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METHODS FOR TOANSFORMING PLANTS TO EXPRESS BAOLLUS THURINGIENSIS DELTA-ENIXm)XINS 

1.0 Background of the Invention 

1.1 Field of the Invention 

The present invention relates generally to transgenic plants having insecticidal capabilities, and to 
DNA constructs utilized to transfer genes conferring insect resistance into plant genomes. More 
specifically, the present invention relates to a method of expressing insecticidal proteins in plants 
transformed with a B. thuringiemis 6-endotoxin encoding gene, resulting in effective control of 
susceptible target pests. 

1.2 Description of Related Art 

1 .2.1 Methods of Controlling Insect Infestation in Plants 

The Gram-positive soil bacterium B. thuringiemis is well known for its production of 
proteinaceous parasporal crystals, or 8-endotoxins, that are toxic to a variety of Lepidopteran, 
Coleopteran. and Dipteran larvae. B. thuringiemis produces crystal proteins during sporulation which are 
specifically toxic to certain species of insects. Many different strains of B. thuringiemis have been shown 
to produce insecticidal crystal proteins. Compositions comprising A thuringiensis strains which produce 
proteins having insecticidal activity have been used commercially as environmentally-acceptable topical 
insecticides because of their toxicity to the specific target insect, and non-toxicity to plants and other non- 
targeted organisms. 

5-endotoxin crystals are toxic to insect larvae by ingestion. Solubilization of the crystal in the 
midgut of the insect releases the protoxin form of the 6-endotoxin which, in most instances, is 
subsequently processed to an active toxin by midgut protease. The activated toxins recognize and bind to 
the brush-border of the insect midgut epithelium through receptor proteins. Several putative crystal 
protein receptors have been isolated from certain insect larvae (Knight et al, 1995; Gill et al, 1995; 
Masson et al, 1995). The binding of active toxins is followed by interrelation and aggregation of toxin 
molecules to form pores within the midgut epithelium. This process leads to osmotic imbalance, 
swelling, lysis of the cells lining the midgut epithelium, and eventual larvae mortality. 
1.2.2 Transgenic B. thuringiensis 6-Endotoxins as Biopesticides 

Plant resistance and biological control are central tactics of control in the majority of insecticide 
improvement programs applied to the most diverse crops. With the advent of molecular genetic 
techniques, various 5-endotoxin genes have been isolated and their DNA sequences determined. These 
genes have been used to construct certain genetically engineered B. thuringiensis products that have been 
approved for commercial use. Recent developments have seen new 8-endotoxin delivery systems 
developed, including plants that contain and express genetically engineered 6-endotoxin genes. 
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Expression of B. thwingiensis S^ndotoxins in plants holds the potential for effective management of 
plant pests so long as certain problems can be overcome. These problems include the development of 
insect resistance to the particular Cry protein expressed in the plant, and development of morphologically 
abnormal plants because of the presence of the transgene. 

Expression of B. thuringieruiis 8-endotoxins in transgenic cotton, corn, and potatoes has proven 
to be an effective means of controlling agriculturally important insect pests (Perlak et al, 1990; Koziel et 
al, 1993; Perlak et al, 1993). Transgenic crops expressing* thurmgiemis 5-endotoxins enable growers 
to significantly reduce the application of costly, toxic, and sometimes ineffective topical chemical 
insecticides. Use of transgenes encoding B thuringiensis 5-endotoxins is particularly advantageous when 
insertion of the transgene has no negative effect on the yield of desired product from the transformed 
plants. Yields from crop plants expressing certain B thurmgiemis 5-endotoxins such as Cry 1 A or Cry3A 
have been observed to be equivalent or better than otherwise similar non-transgenic commercial plant 
varieties. This indicates that expression of some B thuringiensis 5-endotoxins does not have a 
significant negative impact on plant growth or development. This is not the case, however, for all B 
thuringiensis 5-endotoxins that may be used to transform plants. 

The use of topical B thuringiensis-deriytd insecticides may also result in the development of 
insect strains resistant to the insecticides. Resistance to Cr/1 A B thuringiensis S-endotoxins applied as 
foliar sprays has evolved in at least one well documented instance (Shelton et al., 1993). It is expected 
that insects may similarly evolve resistance to B thuringiensis 5-endotoxins expressed in transgenic 
plants. Such resistance, should it become widespread, would clearly limit the commercial value of com. 
cotton, potato, and other germplasm containing genes encoding B. thuringiensis 6-endotoxins. One 
possible way to both increase the effectiveness of the insecticide against target pests and to reduce the 
development of insecticide-resistant pests would be to ensure that transgenic crops express high levels of 
B thuringiensis 8-endotoxins (McGaughey and Whalon, 1993; Roush, 1994). 

In addition to producing a transgenic plant which expresses B thuringiensis 5-endotoxins at high 
levels, commercially viable B. thuringiensis genes must satisfy several additional criteria. For instance, 
expression of these genes in transgenic crop plants must not reduce the vigor, viability or fertility of the 
plants, nor may it affect the nonrial morphology of the plants. Such detrimental effects have two 
undesired results: they may interfere with the recove^^ and propagation of transgenic plants; they may 
also impede the development of mature plants, or confer unacceptable agronomic characteristics. 

There remains a need for compositions and methods useful in producing transgenic plants which 
express R thuringiensis 5-endotoxins at levels high enough to effectively control target plant insect pests 
as well as prevent the development of insecticide-resistant pest strains. A method resulting in higher 
levels of expression of the B thuringiensis 6-endotoxins will also provide the advantages of more 
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ftequent attainment of commercialiy viable transformed plant lines and more effective protection from 
infestation for the entire growing season. 

There also remains a need for a method of increasing the level of expression of A thwingiemis 6- 
endotoxins which does not simultaneously result in plant moqjhological changes that interfere with 
optimal growth and developmem of desired plant tissues. For example, the method of potentiating 
expression of the B. thuringiemis 8-endotoxins in corn should not result.in a com plant which cannot 
optimally develop for cultivation. Achievement of these goals such as high expression levels as well as 
recoveo' of morphologically normal plants has been elusive, and their pursuit has been ongoing and an 
important aspect of the long term value of insecticidal plant products. 
2.0 Summary of the Invention 

Described are novel methods for expressing Cry2A B. thwmgiensis 5-endotoxins which lack 
significant Dipteran inhibiting activity in transformed plants. This method advantageously results in both 
increased levels of expression of B. thuringiensis 8-endotoxins as well as a higher rate of recovery of 
morphologically-normal plants. 

By achieving high rates of expression, the present invention addresses another limitation of the 
prior art: development of insect resistance. Specifically, the instant invention provides a superior strategy 
for the delay or elimination of the development of resistance to Co'l A 5-endotoxins. the B. thuringiensis 
proteins most commonly expressed by transgenic lines. The disclosed methods involve expression of the 
Cry2A class of B. thuringiemis 5-endotoxins and particularly those that lack Dipteran-inhibiting activity. 
B. thuringiemis 6-endotoxins of the Cry2A group have no significant homology to CrylA-type 8- 
endotoxins and display distinct binding and pore-forming characteristics (English et al, 1994). and as 
such are expected to control insects that become resistant to, or that are not affected by. C^IA 5- 
endotoxins (Hofte and Whiteley, 1989). 

In preferred embodiments, the present invention provides an isolated and purified DNA construct 
comprising a Cry2A 5-endotoxin-encoding region localized to a plastid or chloroplast, or localized to a 
plam cell nuclear genome and operably linked to a region encoding a plastid transit peptide (PIP). 
Preferred DNA constructs of the present invention include those constructs that encode Cry2A 5- 
endotoxins lacking Dipteran-inhibitcry activity, though complete inactivity towards Dipterans is not 
required. In an illustrative embodiment, DNA constructs of the present invention encode a Cry2Ab 5- 
endotoxin operably linked to a DNA segment (or sequence) encoding a plastid transit peptide, which is 
one means of enabling localization of a Cry2Ab 5-endotoxin to a plastid or chloroplast. In certain 
embodiments, the Cry2Ab 5-endotoxin comprises the sequence of SEQ ID N0:2. The inventors 
contemplate, however, that any Co'2A 6-endotoxin lacking Dipteran-inhibitoo^ activity may be utilized 
according to the present invemion, with those bearing substantial homologies to Cry2Ab being 
particularly preferred. 
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In another embodiment, the DNA constructs of the present invention exploit nucleic acid 
segments encoding PTPs to potentiate expression of the 8-endotoxin. The use of one type of PTP, a 
chloroplast targeting peptide (CT?), in conjunction with a crylA B. thuringiensis transgene to promote 
expression of the transgene in the transformed plant is disclosed In U. S. Patent 5,500,365 (specifically 
incorporated herein by reference in its entirety). Where increased expression was observed, however, it 
was ascribed in part to the use of a new 5' untranslated leader sequence in the expression vector. 

In contrast to the prior art, the present invention discloses a structural DNA sequence that causes 
the production of an RNA sequence which encodes a targeted fusion pretein comprising an amino- 
terminal plastid transit peptide with a Cry2Ab 5-endotoxin; and a 3' non-translated DNA sequence which 
functions in plant cells to cause transcriptional termination and the addition of polyadenylated nucleotides 
to the 3' end of the RNA sequence. Surprisingly, this DNA construct results in increased levels of 
expression of the CryiA 5-endotoxin. The targeted fusion protein is non-active to all species, but is 
produced as a means for localizing the mature, insecticidally active 5-endotoxin protein to the chloroplast, 
yielding surprising and unexpected beneficial agronomic effects. 

One embodiment conceived of in the present invention is the introduction of a gene encoding a 
Cry2A 5-endotoxin lacking Dipteian activity into the chloroplast or plastid genome. Alternatively, a gene 
encoding a Cry2A 6-endotoxin lacking Dipteran activity could be expressed from an autonomously 
replicating episomal element located within the chloroplast or plastid. 

In another preferred embodiment, the invention provides for transgenic plants which have been 
transformed with an isolated and purified DNA construct that is translated and expressed at high levels by 
the plant. Both monocot and dicot plants may be transformed according to the methods and with the 
DNA constructs disclosed herein. The plant transformed by the instant invention may be prepared, in a 
f^irther preferred embodiment, by a process including obtainment of the isolated and purified DNA 
construct, and then transforming the plant with the construct so that the plant expresses the proteins for 
which the construct encodes. The inventors have observed that transformation of plants by the disclosed 
metiiods results in increased frequency of transformants which express the transgene, as well as the 
generation of more morphologically normal plants from initial transformants. 

It is contemplated that the increased expression levels observed in the disclosed invention will 
allow for reduced development of insect resistance to Bt 5-endotoxins. This may be achieved by 
transforming a plant with the preferred DNA construct to achieve high rates of Cry2A expression alone, 
or by simultaneously exposing target insects to CrylA and non-Dipteran active Ciy2A 8-endotoxins 
expressed in susceptible plants. Such insects include Ostrina spp., Diatraea spp.. Helicoverpa spp.. and 
Spodoptera spp.. in Zea mays; Heliothis virescem, Helicoverpa spp., Pectinophora spp.. in Gossypium 
hirsutvm; Anlicarsia spp.. Pseudoplusia spp., Epinotia spp., in Glycine max; and Scirpophaga wcertulas 
in Oryza sativa. 
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It is therefore contemplated that the method disclosed by the present invention will provide many 
advantages over the prior art including those specifically outlined above. TTiese advantages include 
obtammg improved control of susceptible insects; minimizing the development of insecticide-resistant 
msect strams; obtaining a greater number of commercially viable insect resistant plant lines; achieving 
season long protection from insect pathogens; and increasing the incidence of morphologically-„o™.al 
transformed plants. An additional advantage of the presem invention is that .^uced number, of 
transgenic lines would need to be produced in order to identify a transgenic event with nom,al growth 
characteristics. 

It is therefore contemplated that the method disclosed by the present invention will provide many 
advantages over the prior art including those specifically outlined above. TT,ese advantages include- 
obtammg improved control of susceptible insects; minimizing the development of insecticide-resistani 
msect strams; obtaining a greater number of commercially viable insect resistant plant lines; achieving 
season long protection from insect pathogens; and increasing the incidence of mon^hologically-normal 
transformed plants. An additional advantage of the present invention is that reduced numbers of 
transgenic lines would need to be produced in order to identify a transgenic event with nonnal growth 
characteristics. 

2.1 Nucleic Acid Compositions 

In one important embodiment, the invention provides an isolated and purified nucleic acid 
construct comprising a Co.2A coding region and a PTP coding region. These DNA constructs when 
transferred into a plant, undergo cellular processes resulting in increased expression of 5-endotoxins in 
the transgenic plant. The Co.2A endotoxins of the instant invention are preferably not effective against 
Dipteran species, though some adverse effects on Dipterans may be tolerated. In certain embodiments 
the DNA construct encodes a Dipteran-inactive Cry2Ab 5-endotoxin, and in more prefeired 
embodiments. Ae Cry2Ab 5-endotoxin has the polypeptide sequence of SEQ ID NO-2 or one 
substantially homologous to the polypeptide sequence of SEQ ID N0:2. Such nucleotide homologues 
may be greater than approximatefy 88% homologous, greater than about 90% homologous, greater than 
about 95% homologous, and even greater than about 99% homologous with the Co'2Ab 6-e„dotoxin 
disclosed in SEQ ID N0:2. Exemplary peptides include those that are about 88, 89. 90, 91. 92, 93, 94, 

95, 96. 97. 98 or even 99 or greater percem homologous to the Cry2Ab 5-endotoxin disclosed in SEQ ID 
NO:2. 

In even more preferred embodiments, the DNA construct of the present invention comprises a 
C.y2Ab 5-endotoxin-encoding region with the nucleic acid sequence of SEQ ID N0:1, or a sequence 
substantially homologous to that of SEQ ID N0:1. Also envisioned as within the scope of this invention 
are those DNA constructs having segments with substantial homologies to the nucleic acid sequence 
disclosed in SEQ ID NO:l. such as those which may be about 90% homologous, or about 95 % 
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homologous, or even about 99% homologous. More specifically, homologous nucleic acid sequences 
included in the present invention include those that are about 90, 91, 92 ,93, 94, 95, 96, 97, 98, and 99 
percent homologous to the nucleic acid sequence of SEQ ID NO: 1 . 

The DNA constructs provided herein also include a PTP coding region positioned upstream of the 
cry2A 6-endotoxin coding region and downstream of a promoter. These plastid transit peptide coding 
regions may encode any plant fiinctional PTP, and may operate to target encoded proteins to certain 
plastids within the plant cell, or to increase the expression of the 8-endotoxin for which the DNA 
construct encodes. In preferred embodiments, the present invention may include a PTP selected from the 
group including zmSSU, PTPl, PTPIA , and PTP2, or any other plant functional PTPs. More preferably, 
the plastid transit peptide coding region encodes a plastid transit peptide having the amino acid sequence 
of SEQ ID N0:4, SEQ ID N0:6, SEQ ID NO:8, or SEQ ID NO: 10, or any polypeptide sequence 
substantially homologous to these. Even more preferably, the instant invention comprises a plastid 
transit peptide coding region having the nucleic acid sequence of SEQ ID NO:3, SEQ ID NO:5, SEQ ID 
N0:7, or SEQ ID NO:9, or a nucleic acid sequence which is substantially homologous to these. 

Also, the inventors contemplate that the present invention would further achieve the goals of 
increased pathogenicity to pests, and result in decreased development of pesticide-resistant insects, if the 
DNA constructs provided herein were co-expressed along with other pesticidal compositions such as 
other proteins. Accordingly, the invention provides for use of the disclosed DNA constructs which 
further comprise plant-expressible coding regions for other Cry proteins. Included in these would be 
coding regions for Cry] proteins such as Cryl A, Cryl Ab, CrylBb, or Cryl chimeras (see co-pending US 
applications Serial No.'s 08/754,490 and 08/922,505, and co-pending PCT Application PCT/US97/17507 
based on US Application Serial No. 08/721,259, each specifically incorporated herein by reference in its 
entirety). 

In certain preferred embodiments, the DNA construct is an expression cassette which can be 
excised and isolated from said plasmid. 
2,2 Additional Nucleic acid Composition Elements 

The polynucleotide compositions of the present invention are useful in transforming both 
monocotyledonous and dicotyledonous plants. Accordingly, the DNA construct of the present invention 
may further comprise other various regulatory elements to aid in protein expression and to further 
facilitate introduction of the DNA construct into the plant. One example of this is the inclusion, in the 
DNA construct, of an intron positioned in the untranslated leader, upstream relative to the plastid transit 
peptide coding region. One useful leader sequence is the petunia heat shock protein. In various 
alternative embodiments, the intron may be any of the following: Adh intron 1, sucrose synthase intron, 
TMV omega element, maize heat shock protein (hsp) 70, or the rice Act] intron. In preferred 
embodiments, the intron is either maize heat shock protein 70 or petunia heat shock protein 70. 
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Provided in another preferred embodiment of the present invention is a polynucleotide sequence 
comprising a substantially Dipteran inactive ay2A 5-endotoxin coding region and a FTP coding region 
positioned under the control of a plant operable promoter. The use of a promoter is required for driving 
cellular processes so that expression of the gene is maximized. Preferred promoters include the 
following: CaMV 35S, histone, CaMV 19S. ms. OCS, Adh, sucrose synthase, a-tubulin, actin, cab, 
PEPCase, ssRUBISCO. ActI, Famv, enhanced FMV, or R-gene complex associated promoters. In more 
preferred embodiments, the promoter is the enhanced or duplicated CaMV 35S promoter (Kay et al, 
1987). In additional preferred embodiments, the promoter is the FMV35S promoter. Plant chloroplast or 
plastid functional promoters are also within the scope of the present invention. 

The present invention further contemplates the inclusion of a terminator region in the DNA 
construct to aid cellular processes involved with protein expression. In various embodiments, this 
terminator may be any of the following: the Agrobacterium tumefaciens nopaline synthase gene 
terminator, the Agrobacterium tumefaciens octopine synthase gene teiroinator, and the 3' end of the 
protease inhibitor I or II genes from potato or tomato. In an especially prefeired embodiment, the 
terminator is the Agrobacterium tumefaciens nopaline synthase gene terminator. 
23 Transformation Vectors 

Because the DNA construct of the presem invention is primarily, though not exclusively, 
intended for use in the transformation of plants, it is in certain preferred embodiments, contained within 
an expression vector. Such expression vectors may contain a variety of regulatory and other elements 
intended to allow for optimal expression of the desired proteins for which the expression vector encodes. 
These additional elements may include promoters, terminators, and introns as outlined above In section 
2.2. The vector containing the DNA construct and any regulatory or other elements may be selected from 
the group consisting of a yeast artificial chromosome, bacterial artificial chromosome, a plasmid, or a 
cosmid. 

Further, the expression vectors themselves may be of a variety of forms. These forms may differ 
for various reasons, and will likely be comprised of varying componems depending upon whether they are 
intended to transform a monocotyledonous plant or a dicotyledonous plant. For example, FIG. I 
illustrates one possible embodiment, where the monocotyledonous expression vector contains the crylMi 
gene in the plasmid designated as (SEQ ID NO:16). It is further contemplated that other expression 
vectors containing the expression cassettes embodied in these plasmid vectors, as well as expression 
cassettes containing substantial homologues, will also be useful transformation constructs. Accordingly, 
any transformation vector containing the nucleic acid sequence of from nucleic acid 1 78 1 to 5869 of SEQ 
ID NO: 16. 

FIG. 2 illustrates one possible dicotyledonous expression vector. It contains the co'2Ab gene 
embodied in the plasmids designated as pMON33827 (SEQ ID N0:13), pMON33828 (SEQ ID NO: 14), 
and PMON33829 (SEQ ID N0:15). As with the illustrative monocotyledonous transformation vectors. 
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the inventors further cor,ten,plate that other expression vectors containing the expression cassettes 
embodied in these plasmid vectors, or substantial homologues to those expression cassettes, will be useful 
as dicotyledonous transformation constructs. PrefermI dicotyledonous expression cassettes include those 
embodied by nucleic acids 17 to 3182 of SEQ ID N0:13; nucleic acids 17 to 3092 of SEQ ID NO 14 and 
nucleic acids 17 to 3155 of SEQ ID N0:15. Illustrative embodiments of vectors containing 'such 
expression cassettes are disclosed in the sequences designated herein as SEQ ID NO: 13, SEQ ID NO" 14 
and SEQ ID NO: 15. ' ' ' 

Vectors further envisioned to be within the scope of the present invention include those vectors 
capable of containing both the Dipteran-inactive cry2A nucleic acid compositions disclosed in section 2 1 
above, as well as any other DNA constructs which further comprise plant-expressible coding regions for 
other Cry proteins such as a Co'l protein. Vectors capable of containing both of these constructs may 
further comprise an internal ribosome entry sil« between the DNA construct; they may also contain a 
variety of different cistrons, rendering them polycistronicor multicistronic 
2.4 Transformed Host Cells 

Another preferred embodiment of the present invention encompasses cells transformed with the 
DNA constructs disclosed herein in sections 2.1 and 2.2, and by use of the transformation vectors 
d.sclosed in section 2.3. Transfonned cells contemplated in the present invention include both 
prokaryofc and eukaryotic cells which express the proteins encoded-for by the novel DNA constructs of 
the present invention. The process of producing transgenic cells is well-known in the art. In general the 
method comprises transforming a suitable host cell with a DNA segment which contains a pn,moter 
operatively linked to a coding region that encodes a B. thuringiemis 5-endotoxin. Such a coding region is 
generally operatively linked to a transcription-terminating region, whereby the promoter is capable of 
driving the transcription of the coding region in the cell, and hence providing the cell the ability to 
produce the 5-e„dotoxin in vivo. Alternatively, in instances where it is desirable to control, regulate, or 
decrease the amount of a particular 6-endotoxin or endotoxins expressed in a particular transgenic cell, 
the invention also provides for the expression of 5-endotoxin antisense mRNA; intron antisense mRNA- 
PTP antisense mRNA; or UTR antisense mRNA. The use of antisense mRNA as a means of controlling 
or decreasing the amount of a given protein of interest in a cell is well-known in the art. 

In a preferred embodiment, the invention encompasses a plant cell which has been transfomied 
with a nucleic acid segment or DNA constmct of the invention, and which expresses a gene or gene 
segment encoding one or more of the Dipteran-inactive Co'2A B. thurirrgiensis 5-endotoxins as disclosed 
herem. As used herein, the term "transgenic plant cell" is intended to refer to a plant cell that has 
mcorporated DNA sequences, including but not limited to genes which are perhaps not normally present 
DNA sequences not nonnally transcribed into RNA or translated into a protein ("expressed") or any 
other genes or DNA sequences which one desires to introduce into the non-transformed plant such as 
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genes which may normally be present in the non-transformed plant but which one desires to either 
genetically engineer or to have ahered expression. 

It is contemplated that in some instances the genome of a transgenic plant of the present invention 
will have been augmented through the stable introduction of a Dipteran-inactive Ciy2A B. thuringiemis 
5-endotoxin-encoding DNA constructs as disclosed in sections 2.1 and 2.2 above. In some instances, 
more than one transgene will be incorporated into the nuclear genome, or into the chloroplast or plastid 
genome of the transformed host plant cell. Such is the case when more than one crystal protein-encoding 
DNA segment is incorporated into the genome of such a plant. In certain situations, it may be desirable to 
have one. two, three, four, or even more B. thw-mgiensis crystal protein-encoding polynucleotides (either 
native or recombinantly-engineered) incorporated and stably expressed in the transformed transgenic 
plant. 

In preferred embodiments, the introduction of the transgene into the genome of the plant cell 
results in a stable integration wherein the offspring of such plants also contain a copy of the transgene in 
their genome. The heritability of this genetic element by the progeny of the plant into which the gene was 
originally introduced is a preferred aspect of this invention. A preferred gene which may be introduced 
includes, for example a B. thuringiensis 5-endotoxin. and pariiculariy one or more of those described 
herein. 

Means for transforming a plant cell and the preparation of a transgenic cell line are well-known in 
the art (as exemplified in U. S. Patents 5,550,318; 5,508,468; 5,482,852; 5.384,253; 5,276,269; and 
5,225,341, all specifically incorporated herein by reference in their entirety), and are briefly discussed 
herein. Vectors, plasmids, cosmids, YACs (yeast artificial chromosomes) and DNA segments for use in 
transforming such cells will, of course, generally comprise either the operons, genes, or gene-derived 
sequences of the present invention, either native, or synthetically-derived, and particularly those encoding 
the disclosed crystal proteins. These DNA constructs can further include structures such as premoters, 
enhancers, polylinkers, or even gene sequences which have positively- or negatively-regulating activitj 
upon the particular genes of interest as desired. The DNA segment or gene may encode either a native or 
modified crystal protein, which will be expressed in the resultant recombinant cells, and/or which will 
impart an improved phenotype to the regenerated plant. 

Transgenic cells specifically contemplated in the present invention include transgenic plant cells. 
Particularly preferred plant cells include those cells obtained from com, wheat, soybean, turf grasses, 
ornamental plant, fruit tree, shrubs, vegetables, grains, legumes, and the like, or any plam into which 
introduction of a Dipteran-inactive 5. thuringiemis 8-endotoxin transgene is desired. 
2.5 Transformed Plants 

In another aspect, plants transformed with any DNA construct of the present invention that 
express the proteins for which the construct encodes, are contemplated as being a part of this invention. 
Accordingly, the invention further provides transgenic plants which have been transformed with a DNA 
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construct. as disclosed herein in sections 2. 1 and 2.2, and transformed by use of transfonnation vectors as 
disclosed in section 2.3. Agronomic, horticultural, ornamental, and other economically or commercially 
useful plants can be made in accordance with the methods described herein, to express B. thuringiemis 5- 
endotoxins at levels high enough to confer resistance to insect pathogens while remaining 
morphologically normal 

Such plants may co-express the §-endotoxin polypeptide along with other antifungal, antibacterial, 
or antiviral pathogenesis-related peptides, polypeptides, or proteins; insecticidal proteins; proteins 
conferring herbicide resistance; and proteins involved in improving the quality or quantity of plant 
products or agronomic performance of plants. Simultaneous co-expression of multiple proteins in plants 
is advantageous in that it exploits more than one mode of action to control plant pathogenic damage. This 
can minimize the possibility of developing resistant pathogen strains, broaden the scope of resistance, and 
potentially result in a synergistic insecticidal effect, thereby enhancing a plant's ability to resist insect 
infestation (Intl. Patent Appl. Publ. No. WO 92/17591, 15 October 1992, specifically inco^jorated herein 
by reference in its entirety). 

The transformed plant of the current invention may be either a monocotyledonous plant or a 
dicotyledonous plant. Where the plant is a monocotyledonous plant, it may be any one of a variety of 
species. Preferred monocotyledonous species encompassed by the presem invention may include maize, 
rice, wheat, barley, oats, rye, millet, sorghum, sugarcane, asparagus, turfgrass, or any of a number of other 
grains or cereal plants. In preferred embodiments, the monocot is a maize plant. 

The present invention also contemplates a variety of dicotyledonous plants such as cotton, 
soybean, tomato, potato, citrus, tobacco, sugar beet, alfalfa, fava bean, pea, bean, apple, cheny. pear] 
strawberry, raspberry, or any other legume, tuber, or fruit plant. In preferred embodiments, the dicijt is a 
soybean plant, a tobacco plant, or a cotton plant. 

Many of the plants intended to be transformed according to the disclosed invention are 
commertjial crop plants. The commercial form of these plants may be the original plants, or their 
offspring which have inherited desired transgenes. Accordingly, plants further contemplated within the 
ambit of the presem invemion include any offspring of plants transformed with any of the permutations of 
the DNA construct which are noted in this application. Specifically, the offspring may be defined as an 
Ro transgenic plant. Other progeny of the transformed plam are also included within the scope of the 
present invention, including any progeny plant of any generation of the transformed plant, wherein the 
progeny plant has inherited the DNA construct from any Ro plant. 

Upon transformation with a specific DNA construct, the nucleic acid or polynucleotide segments 
of the construct may be incorporated in various portions into a chromosome of the transformant. 
Therefore, in another embodiment, the present invention encompasses any transgenic plant or plant cell 
prepared by the use of a DNA construct disclosed herein. Such a plant or cell encompassed by the present 
invemion includes those prepared by a process which has the following steps: (1) obtaining a DNA 



wo 00/26371 

PCT/US99/26086 

-11 - 

construct including a Diptexan-inactive C.y2A B. thuringiensis 5-endotoxin coding region positioned in 
frame and under the control of a promoter operable in the plant, and a plastid transit peptide coding region 
positioned upstream of the CrylA B. tkuringiensis 5-endotoxin coding region and downstream of the 
promoter; and (2) transfonning the plant with the obtained DNA construct, so that the plant expresses the 
Cry2A B. thuringiensis 5-endotoxin. The plant may also have been transformed so that it further 
mcorporates into its genome and expresses other Cry 5-endotoxins. 

In a related aspect, the present invention also encompasses a seed produced by the transformed 
plant, a progeny from such seed, and a seed produced by the progeny of the original transgenic plant 
produced in accordance with the above process. Such progeny and seeds will have a Dipteran-inactive B. 
thuringiensis 5-endotoxin transgene stably incorporated into its genome, and such progeny plants will 
mherit the traits afforded by the introduction of a stable transgene in Mendelian fashion All such 
transgenic plants having incorporated into their genome transgenic DNA segments encoding any DNA 
construct disclosed herein, particularly those disclosed in sections 2.1 and 2.2 ar« aspects of this 
invention. 

Recombinant plants, cells, seeds, and other tissues could also be produced in which only the 
mitochondrial or chloroplast DNA has been altered to incorporate the molecules envisioned in this 
application. Promoters which function in chloroplasts have been known in the art (Hanley-Bowden et al 
Trends m Biochemical Sciences 12:67-70. 1987). Methods and compositions for obtaining cells 
containing chloroplasts into which heterologous DNA has been inserted has been described by Daniel! et 
al., U.S. Pat. No. 5.693,507 (1997). 
2.6 Plant Transformation Methods 
2.6.1 Method of Expressing a Cry2A 5-Endotoxin in a Plant 

In another preferred embodiment, the presem invention provides a method for expressing 
Dipteran-inactive Cry2A B thuringiensis 6-endotoxins at high levels in transgenic plants. TTie disclosed 
methods may exploit any of the DNA constructs disclosed in sections 2.1 and 2.2 above, as well as any of 
the transformation vectors disclosed, for example, in section 2.3 above. TT,e contemplated methods 
enable Cry2A 5-endotoxins, an alternative to CrylA B thuringiensis 5-endotoxins for the control of 
several msect pests, to be expressed in plants without negatively affecting the recovery of agronomic 
qualities of transgenic plants. The invention described herein also enables expression of C.y2A 5- 
endotoxms at levels up to 25 times higher than that achieved by current methods. 

The method described here thus enables plants expressing Ciy2A to be used as either an 
alternative or supplement to plants expressing Cry 1 A-type B thuringiensis 6-endotoxins for both control 
and resistance management of key insect pests, including Ostrina sp, Diatraea sp„ Helicoyerpa sp, 
Spodoptera sp in Zea mays; Heliothis .irescens , Helico.erpa sp, Pectinophora sp. in Gossypiuin 
hrsutum; and Anticarsia sp. Pseudoplusia sp, Epinotia sp in Glycine ma.. It is also contemplated that the 
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methods described may be used to dramatically increase expression of B. thuringiensis 6-endotoxins 
including and related to C^^A, thus increasing its effectiveness against target pests and decreasing the 
likelihood of evolved resistance to these proteins. In one embodiment of the present invention, the 
Cry2Ab 8-endotoxin is expressed. Target pests of this protein and their common hosts are shown bllow 
in Table 1. 

Table 1 

Cry2Ab Target Pests and Common Plant Hosts of those Pests 



Hosts 



Reference 



Ostrina nubialis 
Diatraea grandiosella 
Helicoverpa zea 
Heliothis virescens 
Pectinophora gossypiella 
Anticarsia gemmatalis 
Psetidoplusia includens 
Epinotia t^orema 



Zea mays 

Gossypium hirsutum 
Glycine max 



Donovan 

U. S. Patent 5,338,544 



The method of expressing a Cry2A B. thuringiensis 6-endotoxin in a plant disclosed herein 
includes the steps of: (I) obtaining nucleic acid segment comprising a promoter operably linked to a first 
polynucleotide sequence encoding a plastid transit peptide, and a second polynucleotide sequence, 
encoding a Cry2A B. thuringiensis 5-endotoxin lacking Dipteran activity, to yield a fusion protein 
comprised of an amino-terminal plastid transit peptide and a Cry2A B. tlmringiensis 6-endotoxin lacking 
Dipteran activity; and (2) transforming the plant with the DNA construct of step 1 so that the plant 
expresses the protein fusion. In a preferred embodiment, the nucleic acid segment employed in step (I ) of 
this meAod is structured so that the 5' end of the second polynucleotide sequence is operably linked in the 
same translational reading frame to the 3' end of the first polynucleotide sequence. 

The plant or plant cell transformed by the method disclosed herein may be either a 
monocotyledonous plant or a dicotyledonous plant. Wher* the plant is a monocotyledonous plant, it may 
be any one of a variety of species. Preferred monocotyledonous species encompassed by the present 
invention may include maize, rice, wheat, barley, oats, rye, millet, sorghum, sugarcane, asparagus 
turfgrass, or any of a number of other grains or cereal plants. In preferred embodiments, the monocot is a 
maize plant. 

The present invention also contemplates a process by which a variety of dicotyledonous plants or 
plant cells are transformed. Such dicotyledonous plants may include plants such as cotton, soybean 
tomato, potato, citrus, tobacco, sugar beet, alfalfa, fava bean, pea, bean, apple, cheny, pear, strawberry' 
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raspbeny, or any other legume, tuber, or fruit plant. In preferred embodiments, the dicot is a soybean 
plant, a tobacco plant or cell, or a cotton plant or cell. 

2.6^ Method ofcxpressing a Cry2Ab5-endotoxiii in a Progeny Plant 

As noted with regard to other embodiments disclosed in the present invention, many of the plants 
intended to be transformed according to the disclosed invention are commercial crop plants. The 
commercial form of these plants may be the original plants, or their offspring which have inherited 
desired transgenes. Accordingly, the inventors further contemplate that the method disclosed herein 
includes a method of producing a transgenic progeny plant or progeny plant cell. The method of 
producing such progeny includes: The method of expressing a Cry2A B. thuringiemis 6-endotoxin in a 
plant disclosed herein includes the steps of: (1) obtaining nucleic acid segment comprising a promoter 
operably linked to a first polynucleotide sequence encoding a plastid transit peptide, and a second 
polynucleotide sequence, encoding a C,y2A B. thuringiemis 5-endotoxin lacking Dipteran activity, to 
yield a fusion protein comprised of an amino-terminal plastid transit peptide and a Cry2A B thuringiemis 
5-endotoxin lacking Dipteran activity; (2) obtaining a second plant; and (3) crossing the first and second 
plants to obtain a crossed transgenic progeny plant or plant cell which has inherited the nucleic acid 
segments from the first plant. The present invention specifically encompasses the progeny, progeny plant 
or seed from any of the monocotyledonous or dicotyledonous plants, including those noted in sections 2.5 
and 2.6.1 above. 

2.6.3 Method of Co-Expressing Cry2Ab and other Cry B. thuringiensis S-endotoxins in a 
Plant and a Progeny Plant 

In another preferred embodiment, the method of expressing the Dipteran-inactive Ciy2A B. 
thuringiensis 8-endotoxin disclosed herein includes co-expression of the disclosed DNA construct in any 
of its various embodiments, along with a Cryl R thuringiensis 5-endotoxin. The metiiod of expressing 
these Cry B thuringiensis 5-endotoxins together is expected to achieve increased insecticidal properties in 
the transformed plant through increased expression and decreased developmem of insect resistance - all of 
which are desired results not present in existing technologies. This co-expression may be in the original 
transformant, or in any number of generations of progeny of the original transformant which have 
inherited the genes to co-express the proteins encoded for by any of the DNA constructs disclosed herein. 
3.0 Brief Description of the Drawings 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
reference to one or more of these drawings in combination with the detailed description of specific 
embodiments presented herein. 

FIG. 1. Schematic illustration of elemems of monocot plant cry2Ah expression vectors 
pMON30464, pMON30463, and pMON26800. 
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FIG. 2. Schematic illustration of elements of dicot c/y2Ab expression vectors pMON33830, 
PMON33827, pMON33828, and pMON33829. 

FIG. 3. Schematic illustration of elements of dicot crylk^ expression vectors pMON33803 
PMON33812, PMON3381 1, and pMON33806. 

FIG. 4. Plasmid designated pMON30464. 

FIG. 5. Plasmid designated pMON33827. 

FIG. 6. Plasmid designated pMON33828. 

FIG. 7. Plasmid designated pMON33829. 

4.0 Description of Illustrative Embodiments 

The following detailed description of the invention is provided to aid those skilled in the art in 
practicing the present invention. Even so, the following detailed description should not be construed to 
unduly limit the present invention as modifications and variations in the embodiments discussed herein 
may be made by those of ordinary skill in the art without departing from the spirit or scope of the present 
inventive discovery. 

4.1 Identification of Sequences 

SEQ ID NO:l. Nucleic acid sequence of a crv2Ab gene. 
SEQ ID NO:2. Amino acid sequence of a Cry2Ab B. thwingiensis 6- 
endotoxin. 

SEQ ID NO:3. Nucleic acid sequence of a zmSSU plastid transit peptide. 
SEQ ID NO:4. Amino acid sequence of a zmSSU plastid transit peptide. 
SEQ ID NO:5. Nucleic acid sequence of a plastid transit peptide 1 (PTPl). 
SEQ ID NO:6. Amino acid sequence of a PTPl . 
SEQ ID NO:7. Nucleic acid sequence of a plastid transit peptide lA 
(PTPIA). 

SEQ ID NO:8. Amino acid sequence of a PTPIA. 

SEQ ID NO:9. Nucleic acid sequence of a plastid transit peptide 2 (PTP2). 

SEQ D) NO:I0. Amino acid sequence of a PTP2. 

SEQ ID NO:H. Nucleic acid sequence of a co'2Aa gene. 

SEQIDNO:12. Amino sequence ofaCry2Aa polypeptide. 

SEQIDNO:13. pMON33827. 

SEQIDNO:14. pMON33828. 

SEQIDNO:15. pMON33829. 

SEQIDNO:16. pMON30464. 

SEQ ID NO: 17. Bacillus thuringiensis crylAh gene sequence, UWGCG 
accession number M23724 (Widner and Whiteley). 
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SEQ ID NO:18. Bacillus thurmgiensis crylAb amino acid sequence 
translated from SEQ ID NO: 1 7. 

4.2 Definitions 

The following words and phrases herein have the meanings as set forth below. 
Biological functional equivalents. As used herein such equivalents with respect to the 
msecfcidal proteins of the present invention are peptides, polypeptides and proteins that contain a 
sequence or moiety exhibiting sequence similarity to the novel peptides of the pi^sent invention, such as 
Cry2Ab, and which exhibit the same or similar functional properties as that of the polypeptides disclosed 
herem, including insecticidal activity. Biological equivalents also include peptides, polypeptides and 
protons that react with, i.e. specifically bind to antibodies raised against Cry2Ab and that exhibit the 
same or s.m.lar insecticidal activity, including both monoclonal and polyclonal antibodies. 

Chloroplast or plastid localized, as used herein, refers to a biological molecule, either 
polynucleotide or polypeptide, which is positioned within the chloroplast or plastid such that the molecule 
.3 isolated from the cellular cytoplasmic milieu, and functions within the chloroplast or plastid cytoplasm 
to provide the effects claimed in the instant invention. Localization of a biological molecule to the 
chloroplast or plastid can occur, with reference to polynucleotides, by artificial mechanical means such as 
electroporation, mechanical microinjection, or by polynucleotide coated microprojectile bomba«iment or 
wtth reference to polypeptides, by secretory or import means wherein a natural, synthetic, or heterologous 
plastid or chloroplast targeting peptide sequence is used which functions to target, insert, assist, or 
localize a Imked polypeptide into a chloroplast or plastid. 

Combating or Controlling Insect Damage in an agricultural context refers to reduction in 
damage to a crop caused by infection by an insect pest More generally, this phrase refers to reduction in 
the adverse effects caused by the presence of an undesired insect in any particular location. 

Event rekrs to a transgenic plant derived from the insertion of foreign DNA into one or more 
unique sites in the nuclear genomic DNA. 

Expression: TT,e combination of intracellular processes, including transcription, translation, and 
other mtracellular protein and RNA processing and stabilization functions, undergone by a coding DNA 
molecule such as a structural gene to produce a polypeptide. 

Insecticidal polypeptide refers to a polypeptide having insecticidal properties, e.g., a polypeptide 
which mhtbits the growth, development, viability or fecundity of target insect pests. 

Operably Linked: Nucleic acid coding segments connected in fmme so that the properties of 
one mfluence the e)q)ression of the other. 

Plant-Expressible Coding Regions: Coding regions which are expressible in planta because 
they contam typical plant regulatory elements to facilitate the expression of the gene of interest. 

Plastid Transit Peptide: Any amino acid sequence useful in targeting or localizing a linked 
ammo ac.d, such as a protein fusion, to a subcellular compartmem or organelle such as a plastid 
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Progeny: "Progeny" includes any offspring or descendant of the transgenic plant, or any 
subsequent plant which has the transforraant in its lineage. Progeny is not limited to one generation, but 
rather encompasses the transformant's descendants so long as they contain or express the transgene. 
Seeds containing transgenic embryos as well as seeds from the transgenic plants and their offspring or 
descendants are also important parts of the invention. 

Promoter: A recognition site on a DNA sequence or group of DNA sequences that provide an 
expression control element for a structural gene and to which RNA polymerase specifically binds and 
initiates RNA synthesis (transcription) of that gene. 

Ro is the primary regenerant plant derived from transformation of plant tissue or cells in culture. 
Subsequent progeny or generations derived from the Ro are referred to as R, (first generation). R, (second 
generation), etc. 

Regeneration: The process of growing a plant from a plant cell (e.g., plant protoplast or 

explant). 

stably maintained within a plant plastid or chloroplast refers to the introduction by 
electroporation, transformation, transduction, or micelle or liposome-like fusion of a polynucleotide or 
nucleic acid into a chloroplast or plastid in such a way that the nucleic acid remains within the recipient 
chloroplast or plastid and within all subsequent progeny of the recipient chloroplast or plastid either by 
incorporation by recombination into the chloroplast or plastid genome, or as an autonomously replicating 
covalently closed circular replicon residing within the chloroplast or plastid by means of growth of any 
plant, plant cell, or plant tissue containing such transformed chloroplast or plastid and while in the 
presence of a chemical or compound which requires one or more genes present on and expressed from the 
replicon in order to ensure the survival of the transformed plastid or chloroplast and their progeny piastids 
or chloroplasts within the plant, plant cell, or plant tissue. 

Structural Coding Sequence refers to a DNA sequence that encodes a peptide, polypeptide, or 
protein that is made by a cell following transcription of the structural coding sequence to messenger RNA 
(mRNA), followed by translation of the mRNA to the desired peptide, polypeptide, or protein product 

Structural gene: A gene that is expressed to produce a polypeptide. 

Substantial homology: As this term is used herein, it refers to nucleic acid or polypeptide 
sequences which are about 86% homologous, to about 90% homologous, to about 95% homologous, to 
about 99% homologous. More specifically, the inventors envision substantial homologues to be about 86 
87. 88. 89, 90. 91. 92. 93. 94. 95. 96. 97. 98. and 99 percent homologous to the referent nucleic acid 
sequence of polypeptide. 

Substantial temporal or spatial regulation refers to the expression of a gene within a plant or 
plant tissue from a plant operable promoter. With reference to temporal regulation, a promoter may be 
regulated for expression only during specific times during plant cell or tissue or even whole plant growth 
and development. A promoter which is actively expressing one or more genes only during seed 
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germination would be one example of temporal regulation. Other examples could include promoters 
which are actively expressing one or more genes only during times when the plant, plant cell or plant 
tissue is exposed to certain light intensities or during total darkness. Substantial temporal regulation 
refers to a promoter which is actively expressed at a certain time but which may or may not be completely 
suppressed at other times, such that expression may still be detected by monitoring for the presence of 
some indicator such as an enzyme produced from a coding sequence linked to such promoter, or as 
measured by the increase or decrease in some gene product such as an mRNA produced at various times 
throughout plant growth, differentiation, and development and/or in response to various environmental 
stimuli. Substantial spatial regulation refers to the expression of a gene linked to a promoter from which 
expression proceeds only during growth and development of certain cells or tissues within a plant For 
example, a tapetal promoter would only be expected to be expressed during flower growth and 
development. Similarly, a root specific or root enhanced promoter would only be expected to be 
expressed from within root cells or root tissues. Substantially spatially regulated also refer, to the level of 
expression from a particular tissue specific promoter in that particular tissue and as related to levels of 
expression from that or a similar promoter in other tissues, wherein expression may also be detected in 
tissues other than the particular tissue in which the promoter expression is preferred, but at significantly 
lower expression levels as measured by the production of an enzyme produced from a coding sequence 
hnked to the promoter or by the appearance of some detectable gene product. Promoters can also be both 
substantially temporally and substantially spatially regulated together and simultaneously in a 
coordinately regulated manner. 

Synthetic gene: Synthetic genes encoding the B. thuringiemis 5-endotoxins of the presem 
mvention are those prepared in a manner involving any sort of genetic isolation or manipulation This 
mcludes isolation of the gene from its naturally occurring state, manipulation of the gene as by codon 
modification (as described herein), or site-specific mutagenesis (as described herein), truncation of the 
gene or any other manipulative or isolative method. 

Terminator: The 3' end transcription termination and polyadenylation sequence. 

Transformation: A process of introducing an exogenous DNA sequence (e.g., a vector or a 
recombinant DNA molecule) into a cell or protoplast in which that exogenous DNA is incorporated into a 
chromosome or is capable of autonomous replication. 

Transformed cell: A cell which has been altered by the introduction of one or more exogenous 
DNA molecules into that cell. 

Transgene: A gene construct or DNA segment comprising a gene which is desired to be 
expressed m the recipient cell, tissue or organism. This may include an entire plasmid, or other vector or 
may simply include the functional coding section, region, domain, or segment of the transfen^d DNA 
construct. 
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Transgenic cell: Any cell derived or regenerated from a transformed cell or derived from a 
transgenic cell. Exemplao' transgenic cells include plant calli derived from a transformed plant cell and 
particular cells such as leaf, root. stem, e.g., somatic cells, or reproductive (germ) cells obtained from a 
transgenic plant. 

Transgenic event: A plant or progeny thereof derived from the insertion of foreign DNA into 
the nuclear genome of a plant ceil or protoplast. 

Transgenic plant: A plant or progeny thereof which has been genetically modified to contain 
and express heterologous DNA sequences as proteins. As specifically exemplified herein, a transgenic 
soybean plant is genetically modified to contain and express at least one heterologous DNA sequence 
operably linked to and under the regulatory control of transcriptional control sequences which function in 
plant cells or tissue or in whole plants. A transgenic plant may also be referred to as a transformed plant 
A transgenic plant also refers to progeny of the initial transgenic plant where those progeny contain and 
are capable of expressing the heterologous coding sequence under the regulatory control of the plant- 
expressible transcription control sequences described herein. 

Vector: A DNA molecule capable of replication in a host cell and/or to which another DNA 
segment can be operatively linked so as to bring about replication of the linked segment. A plasmid is an 
exemplary vector. 

43 SYNTHESIS AND ISOLATION OF A NUCLEIC ACID SEGMENT ENCODING A B. THURINGIENSIS 
6-ENDOTOXIN AND PLASTID TARGETING SEQUENCES 

The presem invention discloses novel DNA constructs comprising polynucleotide sequences 
encoding B. thuringiensis 5-endotoxins, as well as plastid targeting sequences. Methods for the 
construction and expression of synthetic B. thuringiensis genes in plants are well known by those of skill 
m the art and are described in detail in U. S. Patent 5,500,365. The present invention contemplates the 
use of Cry2A B. thuringie»sis genes in the transformation of both monocotyledonous and dicotyledonous 
plants. To potentiate the expression of these genes, the present invention provides DNA constructs 
compnsmg polynucleotide segments encoding plastid targeting peptides positioned upstream of the 
polynucleotide sequences encoding the desired R thuringiensis 6-endo.oxins. In particular, sequences 
encoding B thuringiensis 5-endotoxins lacking substantial Dipteran species inhibitory activity are 
contemplated. 

4.4 Probes and Primers 

In one aspect, nucleotide sequence information provided by the invention allows for the 
preparation of relatively short DNA sequences having the ability to specifically hybridize to gene 
sequences of the selected polynucleotides disclosed herein. In these aspects, nucleic acid probes of an 
appropriate length are prepared based on a consideration of selected polypeptide sequences encoding 
Cry2A 8-endotoxin polypeptides, e.g., a sequence such as that shown in SEQ IDNO:!. These nucleic 
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acid probes may also be prepared based on a consideration of selected polynucleotide sequences encoding 
a plastid targeting peptide, such as those shown in SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7 and SEQ 
ID NO:9. The ability of such nucleic acid probes to specifically hybridize to a gene sequence encoding a 
5-endotoxin polypeptide or a plastid targeting peptide sequence lends to them particular utility in a variety 
of embodiments. Most importantly, the probes may be used in a variety of assays for detecting the 
presence of complementary sequences in a given sample. 

In certain embodiments, it is advantageous to use oligonucleotide primers. The sequence of such 
primers is designed using a polynucleotide of the pi^sent invention for use in detecting, amplifying or 
mutating a defined segment of a crystal protein gene from B. thuringiensis using PCR™ technology The 
process may also be used to detect, amplify or mutate a defined segment of the polynucleotide encoding a 
plastid targeting peptide. Segments of genes related to the polynucleotides encoding the 5-endotoxin 
polypeptides and plastid targeting peptides of the present invention may also be amplified by PCR- 
using such primers. 

To provide certain of the advantages in accordance with the present invention, a preferred nucleic 
acd sequence employed for hybridization studies or assays includes sequences that are complementary to 
at least a 14 to 30 or so long nucleotide stretch of a polynucleotide sequence encoding a crystal protein 
such as that shown in SEQ ID NO: 1 , or sequences that are complementary to at least a 14 to 30 or so long 
nucleofde stretch of a sequence encoding a plastid targeting peptide, such as those shown in SEQ ID 
NO:3, SEQ ID N0:5, SEQ ID NO:7, and SEQ ID N0:9. 

A size of at least 14 nucleotides in length helps to ensure that the fragment will be of sufficient 
length to form a duplex molecule that is both stable and selective. Molecules having complementary 
sequences over segments greater than 14 bases in length are generally preferred. In order to increase 
stability and selectivity of the hybrid, and thereby improve the quality and degree of specific hybrid 
molecules obtained, one will generally prefer to design nucleic acid molecules having gene- 
complementary sequences of 1 4 to 20 nucleotides, or even longer where desired. Such fragments may be 
readrly prepared by, for example, directly synthesizing the fragment by chemical means, by application of 
nucleic acid reproduction technology, such as the PCR™ technology of U. S. Patents 4,683,195 and 
4,683,202 (each specifically incoq^orated herein by reference), or by excising selected DNA fragments 
from recombinant plasmids containing appropriate inserts and suitable restriction sites. 
4^ Expression Vectors 

The present invention also contemplates an expression vector comprising a polynucleotide of the 
present invention. Thus, in one embodiment an expression vector is an isolated and purified DNA 
molecule comprising a promoter operatively linked to a coding region that encodes a polypeptide of the 
present invention, which coding region is operatively linked to a transcription-terminating region 
whereby the promoter drives the transcription of the coding region. The coding region may include a 
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segment encoding a B. thuringiensis 5-endotoxin and a segment encoding a plastid target peptide. The 
DNA molecule comprising the expression vector may also contain a functional intron 

As used herein, the terms "operatively linked" or "operably linked" mean that a promoter is 
connected to a coding region in such a way that the transcription of that coding region is controlled and 
regulated by that promoter. Means for operatively linking a promoter to a coding region to regulate both 
upstream and downstream are well known in the art. 

Preferred plant transformation vectors include those derived from a Ti plasmid of Agrobacterium 
tumefaciens, as well as those disclosed. e.g., by Herrera-Estrella (1983), Bevan (1983), Klee (1985) and 
Eur. Pat Appl. No. BP 0120516 (each specifically incorporated herein by reference). 

Promoters that function in bacteria are well known in the art. Exemplary and preferred promoters 
for the B. thuringiensis crystal proteins include the sigA, sigE, and sigK gene promoters. Alternatively 
native, mutagenized. heterologous, or recombinant crystal protein-encoding gene promoters themselves 
can be used. 

Where an expression vector of the present invention is to be used to transform a plant, a 
promoter is selected that has the ability to drive expression in that particular species of plant. Promoters 
that function in different plant species are also well known in the art. Promoters useful in expressing the 
polypeptide in plants are those which are inducible, viral, synthetic, or constitutive as described (Odell et 
al, 1985), and/or temporally regulated, spatially regulated, and spatio-temporally regulated. Preferred 
promoters include the enhanced CaMV35S promoters, and the FMV35S promoter. 
4.5.1 Vectors with Plastid Targeting Peptide-Encoding Segments 

In accordance with the present invention, expression vectors designed to specifically potentiate 
the expression of the polypeptide in the transformed plant may include certain regions encoding plastid 
targeting peptides (PTP). These regions allow for the cellular processes involved in transcription, 
translation and expression of the encoded protein to be fully exploited when associated with certain B 
thuringiensis 5-endotoxins. Such plastid targeting peptides function in a variety of ways, such as for 
example, by transferring the expressed protein to the cell structure in which it most effectively opemtes, 
or by transferring the expressed protein to areas of the cell in which cellular processes necessary for 
expression are concentrated. 

The use of PTPs may also increase the frequency of recovery of morphologically normal plants, 
and the frequency at which transgenic plants may be recovered. Given that commercially viable 
expression of both QylA and Cry3A-type B thuringiensis 5-endotoxins have been achieved by 
expression of forms of the proteins that remain localized in the cytosol (/.e. non-targeted forms) 
expression of non-targeted forms of both C.y2Aa and C.y2Ab were also initially attempted in transgenic' 
cotton, tobacco, and com. 

In com, non-targeted Cry2Ab expression transformation vectors yield relatively few transgenic 
events {i.e. independem insertion events into the com genome) with Cry2Ab expression levels sufficiem 
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for commercially acceptable insect control. Moreover, many of the com transformants expressing non- 
targeted Cry2Ab exhibited obvious growth defects such as severe reduction in stature (stunting) or severe 
yellowing of the leaves (chlorosis) that rendered the plants commercially unacceptable. Expression levels 
of non-targeted Cry2Ab in com were no higher than approximately 15 ppm, a level minimally required 
for Cry2Ab-mediated control of European com borer (ECB). 

Although studies involving expression of plastid targeted CrylA-type B. thuringiensis 5- 
endotoxins in transgenic plants have been described (Wong et ai, 1992) , targeting of the non- 
homologous Cry2A or Cry2A proteins has not previously been described. One report of plastid targeted 
Cry! Ac expression indicated that such targeting j-esults in little or no increase in Ciyl Ac expression (U.S. 
Patent No. 5,500^65). Another report indicated that an increase in expression of a plastid targeted fomi 
of Cryl Ac required the inclusion of a new 5' untranslated leader sequence (Wong et al., 1992) and that 
the effect of the leader and targeting sequences on expression was highly dependent on the coding 
sequence of the structural gene. Wong et al. concluded that inclusion of both the leader sequence and 
plastid transit peptide increased Cry! Ac expression 18-foId, but the same sequences increased p- 
glucuronidase expression only 6-fold. Finally, none of the previous reports predicted that plastid 
targeting would result in increased recovery of morphologically normal B. thuringiensis expressing 
plants. 

The present invention discloses that transgenic com plants expressing Dipteran inactive Cry2A 
6-endotoxins, such as CryZAb. at levels up to 10-fold higher than required for ECB control were 
recovered at significantly higher frequencies when a plastid targeted form of the Cry2A was used. In the 
case of Cry2Ab, elevated expression is critical in obtaining transgenic com with ECB control since the 
LCso of Cty2Ab against ECB is significantly higher than the LC50 ECB of the CrylAb B. thuringiensis 
currently used to control ECB in transgenic com (U. S. Patent 5,338,544, 1994; Macintosh et al., 1990; 
Armstrong era/., 1995). 

Increased expression is also especially valuable in that it provides additional protection against 
development of resistance v/fl a high dose strategy (McGaughey and Whalon, 1993; Roush, 1994). High 
level expression is even further desirable as it provides sustained insect protection in instances where 
insecticidal gene expression decreases due to environmental conditions. Additionally and unexpectedly, 
com plants transformed with plastid targeted Cry2Ab expression vectors exhibited normal growth and 
development. 

A significant distinction between targeted and non-targeted (cytosolic) expression of Cry2Ab was 
the dramatic increase in levels of Cry2Ab protein in plants transformed with the plastid targeted Cry2Ab 
expression vector relative to plants transformed with the cytosolic Cry2Ab vector. This result was very 
unexpected. Also, in contrast to the teachings of previous work, the invention disclosed herein reveals 
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that enhanced recovery of phenotypically nornial transgenic plants can be achieved using the disclosed 
methods of plastid targeted expression. 

An example of a plastid targeting peptide (PTP) is a chloroplast targeting peptide. Chloroplast 
targeting peptides have been found particularly useful in the glyphosate resistant selectable marker 
system. In this system, plants transformed to express a protein conferring glyphosate resistance are 
transformed with a PTP Aat targets the peptide to the cell's chloroplasts. Glyphosate inhibits the 
shikimic acid pathway which leads to the biosynthesis of aromatic compounds including amino acids and 
vitamins. Specifically, glyphosate inhibits the conversion of phosphoenolpyruvic acid and 3- 
phosphoshikimicacid to 5-enoIpyn.vyl-3-phosphoshikimicacid by inhibiting the enzyme 5-enoIpyn.vyl- 
3-phosphoshikimic acid synthase (EPSP synthase or EPSPS). Supplemental EPSPS, conferred via 
insertion of a transgene encoding this enzyme, allows the cell to resist the effects of the glyphosate. Thus, 
as the herbicide glyphosate functions to kill the cell by interrupting aromatic amino acid biosynthesis' 
particularly in the cell's chloroplast, the PTP allows increased resistance to the herbicide by concentrating 
what glyphosate resistance enzyme the cell expresses in the chloroplast, Le. in the target organelle of the 
cell. Exemplary herbicide resistance enzymes include ESPS as noted above, glyphosate oxido-reductase 
(GOX) and the aroA gene (see U.S. Patem No. 4,535,060, specifically incorporated herein by reference in 
its entirety). 

PTPs can target proteins to chloroplasts and other plastids. For example, the target organelle may 
be the amyloplast. Preferred PTPs of the present invention include those targeting both chloroplasts as 
well as other plastids. Specific examples of preferred PTPs include the maize RUBISCO SSU protein 
PTP, and functionally related peptides such as PTPI, PTPA, and PTP2. These PTPs are exemplified by 
the polypeptides shown in SEQ ID N0:4, SEQ ID N0:6, SEQ ID NO:8, and SEQ ID NO:]0. 
Polynucleotide sequences encoding for these polypeptides are shown in SEQ ID N0:3, SEQ ID NO:5 
SEQ ID N0:7, and SEQ ID N0:9. 

Recombinant plants, cells, seeds, and other plant tissues could also be produced in which only the 
mitochondrial or chloroplast DNA has been altered to incorporate the molecules envisioned in this 
application. Promoters which function in chloroplasts have been known in the art (Hanley-Bowden et al.. 
Trends in Biochemical Sciences 12:67-70, 1 987). Methods and compositions for obtaining cells 
containing chloroplasts into which heterologous DNA has been inserted has been described by Daniell et 
al., U.S. Pat. No. 5,693,507 (1997). McBride et al. (WO 95/24492) disclose localization and expression 
of genes encoding Cry 1 A 5-endotoxin protein in tobacco plant chloroplast genonies. As disclosed herein, 
localization of Co'2Aa to the chloroplast or plastid results in decreased levels of expression as measured 
by accumulation of Cry2Aa 5-endotoxin, which is in contrast to the improved expression of chloroplast or 
plastid localized Cry2Ab 5-endotoxin. 
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4.52 Use ofPromotere in Expression Vectors 

The expression of a gene which exists in double-stranded DNA form involves transcription of 
messenger RNA (mRNA) from the coding strand of the DNA by an RNA polymerase enzyme, and the 
subsequent processing of the mRNA primary transcript inside the nucleus. Transcription of DNA into 
mRNA is regulated by a region of DNA usually refened to as the "promoter". The promoter region 
contains a sequence of bases that signals RNA polymerase to associate with the DNA and to initiate the 
transcription of mRNA using one of the DNA strands as a template to make a corresponding strand of 
RNA. The particular promoter selected should be capable of causing sufficient expression of the enzyme 
coding sequence to result in the production of an effective insecticidal amount of the B. thurmgiensis 
protein. 

The 3' non-translated region of the chimeric plant genes of the present invention also contains a 
polyadenylation signal which functions in plants to cause the addition of adenylate nucleotides to the 3' 
end of the RNA. Examples of preferred 3' regions are (1) the 3' transcribed, non-translated regions 
containing the polyadenylation signal of Agrobacteriun, tumor-inducing (Ti) plasmid genes, such as the 
nopaline synthase (NOS) gene and (2) the 3' ends of plant genes such as the pea ssRUBISCO E9 gene 
(Fischhoffe/o/., 1987). 

A promoter is selected for its ability to direct the transformed plant cell's or transgenic plant's 
transcriptional activity to the coding region, to ensure sufficient expression of the enzyme coding 
sequence to result in the production of insecticidal amounts of the B. thuringiensis protein. Structural 
genes can be driven by a variety of promoters in plant tissues. Promoters can be near-constitutive (/.e. 
they drive transcription of the transgene in all tissue), such as the CaMV35S promoter, or tissue-specific 
or developmentally specific promoters affecting dicots or monocots. Where the pix,moter is a near- 
constitutive promoter such as CaMV35S or FMV35S, increases in polypeptide expression are found in a 
variety of transformed plant tissues and most plant organs {e.g., callus, leaf, seed and root). Enhanced or 
duplicate vereions of the CaMV35S and FMV35S promoters are particularly useful in the practice of this 
mvention (Kay et al, 1987; Rogers, U. S. Patent 5,378,619). 

Those skilled in the art will recognize that there are a number of promoters which are active in 
plant cells, and have been described in the literature. Such promoters may be obtained from plants or 
plant viruses and include, but are not limited to, the nopaline synthase (NOS) and octopine synthase 
(OCS) promoters (which are carried on tumor-inducing plasmids of A. tumefaciens), the cauliflower 
mosaic virus (CaMV) 19S and 35S promoters, the light-inducible promoter from the small subunit of 
ribulose 1.5-bisphosphate carboxylase (ssRUBiSCO, a very abundam plant polypeptide), the rice Actl 
promoter and the Figwort Mosaic Virus (FMV) 35S promoter. All of these promoters have been used to 
create various types of DNA constructs which have been expressed in plants (see e.g., McElrov et al. 
1990. U.S. Patent 5,463.175). 
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In addition, it may also be preferred to bring about expression of the B. thuringiensis 6-endotoxin 
in specific tissues of the plant by using plant integrating vectors containing a tissue-specific promoter. 
Specific target tissues may include the leaf, stem, root, tuber, seed, fruit, etc., and the promoter chosen 
should have the desired tissue and developmental specificity. Therefore, promoter function should be 
optimized by selecting a promoter with the desired tissue expression capabilities and approximate 
promoter strength and selecting a transformant which produces the desired insecticidal activity in the 
target tissues. This selection approach from the pool of transformants is routinely employed in expression 
of heterologous structural genes in plants since there is variation between transformants containing the 
same heterologous gene due to the site of gene insertion within the plant genome (commonly referred to 
as "position effect"). In addition to promoters which are known to cause transcription (constitutive or 
tissue-specific) of DNA in plant cells, other promoters may be identified for use in the current invention 
by screening a plant cDNA library for genes which are selectively or preferably expressed in the tai^et 
tissues and then determine the promoter regions. 

An exemplary tissue-specific promoter is the lectin promoter, which is specific for seed tissue. 
The lectin protein in soybean seeds is encoded by a single gene {Lei) that is only expressed during seed 
maturation and accounts for about 2 to about 5% of total seed mRNA. The lectin gene and seed-specific 
promoter have been fully characterized and used to direct seed specific expression in transgenic tobacco 
plants (Vodkin et al, 1983; Lindstrom et al, 1990). An expression vector containing a coding region that 
encodes a polypeptide of interest can be engineered to be under control of the lectin promoter and that 
vector may be introduced into plants using, for example, a protoplast transformation method (Dhir et al, 
1991). The expression of the polypeptide would then be directed specifically to the seeds of the 
transgenic plant. 

A transgenic plant of the present invention produced from a plant cell transformed with a tissue 
specific promoter can be crossed with a second transgenic plant developed from a plant cell transformed 
with a different tissue specific promoter to produce a hybrid transgenic plant that shows the effects of 
transformation in more than one specific tissue. 

Other exemplaiy tissue-specific promoters are com sucrose synthetase 1 (Yange/«/., 1990), com 
alcohol dehydrogenase I (Vogel et al., 1989), com light harvesting complex (Simpson, 1986), com heat 
shock protein (Odell et al., 1985), pea small subunit RuBP carboxylase (Poulsen et al., 1 986; Cashmore et 
al., 1983), Ti plasmid mannopine synthase (McBride and Summerfelt, 1989), Ti plasmid nopaline 
synthase (Langridge et al, 1989). petunia chalcone isomerase (Van Tunen et al, 1 988). bean glycine rich 
protein 1 (Keller e/ a/., 1 989), CalVfV 35s transcript (Odell e^/., 1985) and Potato patatin(Wenzlere/o/., 
1989). Preferred promoters are the cauliflower mosaic virus (CaMV 35S) promoter and the S-E9 small 
subunit RuBP carboxylase promoter. 

The promoters used in the DNA constmcts of the present invention may be modified, if desired, 
to affect their control characteristics. For example, the CaMV35S promoter may be ligated to the portion 
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of the ssRUBISCO gene that represses the expression of ssRUBISCO in the absence of light, to create a 
promoter which is active in leaves but not in roots. The resulting chimeric promoter may be used as 
described herein. For purposes of this description, the phrase "CaiVlV35S" promoter thus includes 

variationsofCaMV35Spromoter,e.g., promoters derived by means of ligation with operatorregions, 
random or controlled mutagenesis, etc. Furthennore. the promoters may be altered to contain multiple 
"enhancer sequences" to assist in elevating gene expression. Examples of such enhancer sequences have 
been reported by Kay et al. (mi). Chloroplast or plastid specific promoter, are known in the art 
(Daniell et al., US Pat. No. 5,693,507; herein incorporated by reference), for example promoters 
obtainable from chloroplast genes, such as th^psbA gene from spinach or pea, the rbcL and atpB 
promoter region from maize, and rRNA promoters. Any chloroplast or plastid operable promoter is 
within the scope of the present invention. 

The RNA produced by a DNA construct of the present invention also contains a 5' non-translated 
leader sequence. This sequence can be derived from the promoter selected to express the gene, and can 
be specifically modified so as to increase translation of the mRNA. The 5' non-translated regions can 
also be obtained from viral RNAs. from suitable eukaryotic genes, or from a synthetic gene sequence. 
The present invention is not limited to constructs wherein the non-translated region is derived from the 5' 
non-translated sequence that accompanies the promoter sequence. As shown below, a plant gene leader 
sequence which is useful in the presem invention is the petunia heat shock protein 70 (hsp70) leader 
(Winter e/fl/., 1988). 

An exemplary embodiment of the invention involves the plastid targeting or plastid localization 
of the B. thwingiemis amino acid sequence. Plastid targeting sequences have been isolated from 
numerous nuclear encoded plant genes and have been shown to direct importation of cytoplasmically 
synthesized proteins into plastids (reviewed in Keegstra and Olsen, 1989). A variety of plastid targeting 
sequences, well known in the art, including but not limited to ADPGPP, EPSP synthase, or ssRUBISCO, 
may be utilized in practicing this invention. In alternative embodiments preferred, plastidic targeting 
sequences (peptide and nucleic acid) for monocotyledonous crops may consist of a genomic coding 
fragment containing an intron sequence as well as a duplicated proteolytic cleavage site in the encoded 
plastidic targeting sequences. 

The most preferred nucleic acid sequence, referred to herein as zmSSU PTP (SEQ ID N0:3), 
consists of a genomic coding fragment containing an intron sequence as well as a duplicated proteolytic 
cleavage site in the encoded plastidic targeting sequences, was derived from plastid targeting sequence 
zmSl (Russell et al., 1993). Direct translational fusions of zmSSU PTP peptide sequence (SEQ ID N0:4) 
to the amino terminus of the sequences are useful in obtaining elevated levels of the polypeptide in 
transgenic maize. In-frame fusions of the zmSSU PTP nucleic acid sequence (SEQ ID N0:3) to the 
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crylAb gene (SEQ ID NO:l) can be effected by ligation of the Ncol site at the 3' (C-tenninal encoding) 
end of the zmSSU FTP sequence with the 5' Ncol site (N-tem,inal encoding) of the crylAh sequence. 

The preferred sequence for dicotyledonous crops referred to herein as PTP2 (SEQ ID NO-9) 
consists of a genomic coding fragment containing the chloroplast targeting peptide sequence from the' 
EPSP synthase gene of Arabidopsis thalicma in which the transit peptide cleavage site of the pea 
ssRUBISCO PTP replaces the native EPSP synthase PTP cleavage site (Klee et al, 1 987). 

As noted above, the 3' non-translated region of the chimeric plant genes of the present invention 
contains a polyadenylation signal which functions in plants to cause the addition of adenylate nucleotides 
to the 3' end of the RNA. Examples of preferred 3' regions are (1) the 3' transcribed, non-translated 
legions containing the polyadenylate signal of Agrobacteriun, tumor-inducing (Ti) plasmid genes, such as 
the nopalme synthase (NOS) gene and (2) plant genes such as the pea ssRUBISCO E9 gene (Fischhoff 
al, 1987). 

4.5.3 Use of Introns in Expression Vectors 

For optimized expression in monocotyledonous plants, an intron may also be included in the 
DNA expression construct. Such an intron is typically placed near the 5'-e„d of the mRNA in 
untranslated sequence. This intron could be obtained from, but not limited to. a set of introns consisting 
of the maize Heat Shock Protein (HSP) 70 intron (U. S. Patent 5,424,412; 1995). the rice Actl inlron 
(McElroy et al., 1990). the Adh intron 1 (Callis e/ ,987). or the sucrose synthase intron (Vasil al 
1989). As shown herein, the maize HSP70 intron is useful in the present invention. 

4.5.4 USE OF TERMINATORS IN EXPRESSION VECTORS 

RNA polymerase transcribes a nuclear genome coding DNA sequence through a site where 
polyadenylation occurs. Typically, DNA sequences located a few hundred base pairs downstream of the 
polyadenylation site sen.e to terminate transcription. Those DNA sequences are referred to herein as 
transcnption-termination regions. Those regions are required for efficient polyadenylation of transcribed 
messenger RNA (mRMA). For coding sequences introduced into a chloroplast or plastid, or into a 
chloroplast or plastid genome. mRNA transcription termination is similar to methods well known in the 
bacterial gene expression art. For example, either in a polycistronic or a monocistronic sequence 
transcription can be tenninated by stem and loop structures or structures similar to rho dependem 
sequences. 

Constructs will typically include the gene of interest along with a 3' end DNA sequence that acts 
as a signal to terminate transcription and. in constructs intended for nuclear genome expression, allow for 
the polyadenylation of the resultant mRNA. The most preferred 3' elements are contemplated to be those 
from the nopaline synthase gene of ^. tume/aciens („os 3'end) (Bevan er al., ,983). the terminator for the 
T7 transcript from the octopine synthase gene OF^. tume/aciens, and the 3' end of the protease inhibitor i 
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or ii genes from potato or tomato. Regulatory elements such asTMVQ element (Galiie, et al, 1989), 

may further be included where desired. 

4.5.5 Other Expression-Enhancing Elements 

Another type of element which can regulate gene expression is the DNA sequence between the 
transcription initiation site and the start of the coding sequence, tenned the untranslated leader sequence 
The leader sequence can influence gene expression. Compilations of leader sequences have been made to 
predict optimum or sub-optimum sequences and generate "consensus" and preferred leader sequences 
(Joshi, 1987). Preferred leader sequences are contemplated to include those which comprise sequences 
predicted to direct optimum expression of the linked structural gene, i.e. to include a preferred consensus 
leader sequence which may increase or maintain mRNA stability and prevent inappropriate initiation of 
translation. The choice of such sequences will be known to those of skill in the art in light of the present 
disclosure. Sequences that are derived from genes that are highly expressed in plants, and in maize in 
particular, will be most preferred. One particularly useful leader may be the petunia HSP70 leader. 

Transcription enhancers or duplications of enhanceis could be used to increase expression. These 
enhancers often are found 5' to the start of transcription in a promoter that functions in eukaryotic cells, 
but can often be inserted in the forward or reverse orientation 5' or 3' to the coding sequence. Examples 
of enhancers include elements from the CaMV 35S promoter, octopine synthase genes (Ellis et al., 1987), 
the rice actin gene, and promoter from non-plant eukaryotes {e.g.. yeast; Ma et al, 1988). 
4.5.6 Multigene Vector Constructs and IRES 

In certain embodiments of the invention, the use of internal ribosome binding sites (IRES) 
elements are used to create multigene, or polycistronic. messages. IRES elements are able to bypass the 
ribosome scanning model of 5' methylated Cap dependent translation and begin translation at internal 
sites (Pelletier and Sonenberg, 1988). IRES elements from two members of the picomavirus family 
(polio and encephalomyocarditis) have been described (Pelletier and Sonenberg, 1988), as well an IRES 
from a mammalian message (Macejak and Samow, 1991). IRES elements can be linked to heterologous 
open reading frames. Multiple open reading frames can be transcribed together, each separated by an 
IRES, creating polycistronic messages. By virtue of the IRES element, each open reading frame is 
accessible to ribosomes for efficient translation. Multiple genes can be efficiently expressed using a 
single promoter/enhancer to transcribe a single message. 

Any heterologous open reading frame can be linked to IRES elements. This includes genes for 
secreted preteins. multi-subunit proteins, encoded by independent genes, intracellular or membrane- 
bound proteins and selectable markers. In this way, expression of several proteins can be simultaneously 
engineered mto a cell with a single construct and a single selectable marker. 
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Constnicts intended for expression from within a chloroplast or plastid utilizing chloroplast or 
plastid specific transcriptional and translational machinery can contain either mono- or polycistronic 
sequences. 

43.7 Construction of the Expression Vector 

Tlie choice of which expression vector and ultimately to which promoter a polypeptide coding 
region is operatively linked depends directly on the functional properties desired, e.g., the location and 
timing of protein expression, and the host cell to be transformed. These are well known limitations 
inherent in the art of constructing recombinant DNA molecules. However, a vector useful in practicing 
the present invemion is capable of directing the expression of the polypeptide coding region to which it is 
operatively linked. 

Typical vectors useful for expression of genes in higher plants are well known in the art and 
include vectors derived from the tumor-inducing (Ti) plasmid of ^. tumefaciem described (Rogers et ai, 
1 987). However, several other plant integrating vector systems are known to function in plants including 
pCaMVCN transfer control vector described (Fromm et al., 1986). pCaMVCN (available from 
Pharmacia, Piscataway, NJ) includes the CaMV35S promoter. 

In preferred embodiments, the vector used to express the polypeptide includes a selection marker 
that is effective in a plant cell, preferably a drug resistance selection marker. One preferred drug 
resistance marker is the gene whose expression results in kanamycin resistance; i.e. the chimeric gene 
containing the nopaline synthase promoter, Tn5 neomycin phosphotransferase II (riptU) and nopaline 
synthase 3' non-translated region described (Rogers et ah, 1988). 

Means for preparing expression vectors are well known in the art. Expression (transfomation) 
vectors used to transform plants and methods of making those vectors are described in U. S. Patents 
4,971.908, 4,940,835, 4,769,061 and 4,757,011 (each of which is specifically incorporated herein by 
reference). Those vectors can be modified to include a coding sequence in accordance with the present 
invention. 

A variety of methods have been developed to operatively link DNA to vectors via complementary 
cohesive termini or blunt ends. For instance, complementary homopolymer tracts can be added to the 
DNA segment to be inserted and to the vector DNA. The vector and DNA segment are then joined by 
hydrogen bonding between the complementary homopolymeric tails to form recombinant DNA 
molecules. 

A coding region that encodes a polypeptide having the ability to confer insecticidal activity to a 
cell is preferably a polynucleotide encoding a B. thwingiemis 8-endotoxin or a functional equivalent of 
such a polynucleotide. In accordance with such embodiments, a coding region comprising the DNA 
sequence of SEQ ID NO: 1 is also preferred. 

Specific B. thuringiensis 5-endotoxin polypeptide-encoding genes that have been shown to 
successfully transform plants in conjunction with plastid targeting peptide-encoding genes, to express the 
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B. thuringiemis 5-endotoxins at high levels are those genes comprised within the plasmid vectors. 
Preferred plasmids containing plastid targeting sequences include pMON30464, pMON33827, 
PMON33828, pMON33829. These plasmids are encoded for by the sequences shown in SEQ ID NO 1 6 
SEQ ID N0:13, SEQ ID N0:14, SEQ ID N0:15. More preferably, plants may be successfully 
transformed with any vector containing expression cassettes comprising the nucleotide sequences of 
nucleotide 1781 to 5869 of SEQ ID N0:16, nucleotide 17 to 3182 of SEQ ID NO:13. nucleotide 17 to 
3092 of SEQ ID NO: 1 4 or nucleotide 1 7 to 3 1 55 of SEQ ID NO: 1 5. 

The work described herein has identified methods of potentiating m planta expression of B. 
thuringiensis 5-endotoxins, which confer resistance to insect pathogens when incoqjorated into the 
nuclear, plastid. or chloroplast genome of susceptible plants. U. S. Patent 5,500,365 (specifically 
incorporated herein by reference) describes a method for synthesizing plant genes to optimize the 
expression level of the protein for which the synthesized gene encodes. This method relates to the 
modification of the structural gene sequences of the exogenous transgene, to make them more "plant-like" 
and therefore more likely to be translated and expressed by the plant. A similar method for enhanced 
expression of transgenes, preferably in monocotyledonous plants, is disclosed in U. S. Patent 5,689,052 
(specifically incorporated herein by reference). Agronomic, horticultural, ornamental, and other 
economically or commercially useful plants can be made in accordance with the methods described 
herein, to express B. thuringiensis 5-endotoxins at levels high enough to confer resistance to insect 
pathogens. 

Such plants may co-express the B thuringiensis 6-endotoxin polypeptide along with other 
antifungal, antibacterial, or antiviral pathogenesis-related peptides, polypeptides, or proteins; insecticidal 
proteins; proteins confeiring herbicide resistance; and proteins involved in improving the quality of plant 
products or agronomic performance of plants. Simultaneous co-expression of multiple proteins in plants 
is advantageous in that it exploits more than one mode of action to control plant pathogenic damage. Tliis 
can minimize the possibility of developing resistant pathogen strains, broaden the scope of resistance, and 
potentially result in a synergistic insecticidal effect, thereby enhancing plants ability to resist insect 
infestation (WO 92/1 7591). 

Specifically contemplated for use in accordance with the present invention are vectors which 
include the ocs enhancer element. This element was first identified as a 1 6 bp palindromic enhancer from 
the octopine synthase {ocs) gene of Agrobacterium (Ellis et al, 1987), and is present in at least 10 other 
promoters (Bouchez et al., 1989). It is proposed that the use of an enhancer element, such as the ocs 
element and particularly multiple copies of the element, may be used to increase the level of transcription 
from adjacent promoters when applied in the context of monocot transformation. 

It is contemplated that introduction of large DNA sequences comprising more than one gene may 
be desirable. Introduction of such sequences may be facilitated by use of bacterial or yeast artificial 
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chromosomes (BACs or YACs, respectively), or even plant artificial ch«>mosomes. For example, the use 
of BACs forAgrobacterium-medi&tod transformation was disclosed by Hamilton et al. (1996). 

Ultimately, the most desirable DNA segments for introduction into a monocot genome may be 
homologous genes or gene families which encode a desired trait (for example, increased yield), and which 
are introduced under the control of novel promoters or enhance.., e/c, or perhaps even homologous or 
tissue specific (e.g.. root-collar/sheath-. whorl-, stalk-, earshank-, kernel- or leaf-specific) promoters or 
control elements. Indeed, it is envisioned that a particular use of the present invention may be the 
production of transformants comprising a transgene which is targeted in a tissue-specific manner. For 
example, insect resistant genes may be expressed specifically in the whorl and collar/sheath tissues which 
are targets for the first and second broods, respectively, of ECB. Likewise, genes encoding proteins with 
particular activity against rootworm may be targeted directly to root tissues. 

Vectors for use in tissue-specific targeting of gene expression in transgenic plants typically will 
include tissue-specific promoters and also may include other tissue-specific control elements such as 
enhancer sequences. Promoters which direct specific or enhanced expression in certain plant tissues will 
be known to those of skill in the art in light of the present disclosure. 

It also is contemplated that tissue specific expression may be functionally accomplished by 
introducing a constitutively expressed gene (all tissues) in combination with an antisense gene that is 
expressed only in those tissues where the gene product is not desired. For example, a gene coding for the 
co'stal toxin protein from B. thuringiensis may be introduced such that it is expressed in all tissues using 
the 35S promoter from Cauliflower Mosaic Virus. Alternatively, a rice actin promoter or a histone 
promoter from a dicot or monocot species also could be used for constitutive expression of a gene. 
Furthermore, it is contemplated that promoters combining elements from more than one promoter may be 
useful. For example. U. S. Patent 5,491,288 discloses combining a Cauliflower Mosaic Virus promoter 
with a histone promoter. Therefore, expression of an antisense transcript of the Bt gene in a maize kernel, 
using for example a zein promoter, would prevent accumulation of the 8-endotoxin in seed. Hence the 
protein encoded by the introduced gene would be present in all tissues except the kernel. It is specifically 
contemplated by the inventors that a similar strategy could be used with the instant invention to direct 
expression of a screenable or selectable marker in seed tissue. 

Alternatively, one may wish to obtain novel tissue-specific promoter sequences for use in 
accordance with the present invention. To achieve this, one may first isolate cDNA clones from the tissue 
concerned and identify those clones which are expressed specifically in that tissue, for example, using 
Northern blotting. Ideally, one would like to identify a gene that is not present in a high copy number, but 
which gene product is relatively abundant in specific tissues. The promoter and control elements of 
corresponding genomic clones may this be localized using the techniques of molecular biology known to 
those of skill in the art. 
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It is contemplated that expression of some genes in transgenic plants will be desired only under 
specified conditions. For example, it is proposed that expression of certain genes that confer resistance to 
environmentally stress factors such as drought will be desired only under actual stress conditions. It 
further is contemplated that expression of such genes throughout a plants development may have 
detrimental effects. It is known that a large number of genes exist that respond to the environment. For 
example, expression of some genes such as rbcS, encoding the small subunit of ribulose bisphosphate 
carboxylase, is regulated by light as mediated through phytochrome. Other genes are induced by 
secondary stimuli. For example, synthesis of abscisic acid (ABA) is induced by certain environmental 
factors, including but not limited to water stress. A number of genes have been shown to be induced by 
ABA (Skriver and Mundy. 1990). It also is expected that expression of genes conferring resistance to 
insect predation would be desired only under conditions of actual insect infestation. ITierefore. for some 
desired traits, inducible expression of genes in transgenic plants will be desired. 

It is proposed that, in some embodiments of the present invention, expression of a gene in a 
transgenic plant will be desired only in a certain time period during the development of the plant. 
Developmental timing frequently is correlated with tissue specific gene expression. For example 
expression of zein storage proteins is initiated in the endosperm about 15 days alter pollination. 

It also is contemplated that it may be useful to target DNA itself with a cell. For example, it may 
be useful to target introduced DNA to the nucleus as this may increase the frequency of transfomjation. 
Within the nucleus itself it would be useful to target a gene in order to achieve site specific integration. 
For example, it would be useful to have a gene introduced through transformation replace an existing 
gene in the cell. 

4.6 Identification and Isolation of Insecticidal B. thuringiensis 6-Endotoxins and Genes 

It is contemplated that the method described in this invention could be used to obtain 
substantially improved expression of a number of novel B. thuringiensis endotoxins isolated as described 
below. Identification of new Bacillus thuringimsis strains encoding crystalline endotoxins with 
insecticidal activity has been described previously (Donovan et al., 1992). Isolation of the R 
thuringiensis endotoxin, followed by amino terminal amino acid sequencing, back-translation of the 
amino acid sequence to design an oligonucleotide probe or use of a related B. thuringiensis gene as a 
probe, followed by cloning of the gene encoding the endotoxin by hybridization are familiar to those 
skilled in the art and have been described, (see e.g., Donovan et al., 1992); U. S. Patent 5,264,364, each 
specifically incorporated herein by reference. 

Improved expression of Dipteran-inactive Cry2A B. thuringiensis 8-endotoxins in transgenic 
plants can be achieved via the methods described in this invention. One protein for which improved 
expression is obtained is Ciy2Ab. 

Previous work indicated that certain Cry2A 5-endotoxins were capable of wider host range 
specificity than other closely related Cfy2A 5-endotoxins wherein not only Lepidopteran species, but 



wo 00/26371 

PCTA;S99/26086 

-32- 

Dipteran species also were particularly susceptible to very low toxin doses. In contrast, the closely 
related Cry2A endotoxins not displaying substantial Dipteran inhibitoo' activity were thus shown to be 
more narrow in their host range specificity (Widner et al., 1 989, J. Bacteriol. 1 71 :965-974; Widner et al 
(a). 1990, J. Bacteriol. 172:2826-2832). These works indicated that Cry2Ab as used hei^in does not 
totally lack Dipteran inhibitory activity, but is simply much less potent than other closely related Cry2A 
B. thuringiensis 8-endotoxins. Those works indicated that Cry2Ab in particular was much less effective 
than Cry2Aa. and hence lacked Dipteran activity when tested against Aedes egyptii. TTiere is no one 
single acceptable means for distinguishing between closely related 8-endotoxins, however, as indicated 
herein, selection of an appropriate Ciy2A could be accomplished by using one or a combination of 
several methods including but not limited to comparisons in overall amino acid sequence homology, 
narrowly focused similarity comparisons between Cry2A's in the region specified by amino acid 
sequence 307-382, or based on IC50 data. Widner et al. demonstrated 50-100 times mor^ Cry2Ab than 
Cry2Aa was required to obtain a similar 1C50 effect on a Dipteran species. Thus, the range of 
susceptibility of a Dipteran species toward a Cry2A protein could be used as one means of measuring and 
distinguishing target insect susceptibility differences between different classes of Cry2A proteins For 
example, an IC50 PPM value of about 3-fold greater than that exhibited by Ciy2Aa against^erfe. egypHi 
could be utilized as a feature for excluding certain Cry2A proteins as lacking substantial Dipteran species 
mhibitoo^ activity. However, utilizing an approach based on IC50 inhibitory activity ranges should be 
used with caution, as these values are veo' dependent upon a number of highly variable conditions 
including but not limited to the methods and materials used for assaying the proteins and the physical 
conditioning of the insects assayed. An alternative means for distinguishing Cry2A 5-endotoxins lacking 
substantial Dipteran species inhibitory activity from S-endotoxins which are not within the scope of the 
pr^ent invention could encompass excluding Cry2A proteins which are greater than about 87% similar in 
amino acid sequence to C,y2Aa, or more preferentially excluding Cry2A proteins which are greater than 
about 90% similar in amino acid sequence to Cry2Aa. In particular, the region of C.y2Aa corresponding 
to amino acid residues from about 307 to about 382 are believed to be critical for the Dipteran inhibitoiy 
activity of the protein, and when substituted for the complementary region of dissimilarity in Cry2Ab 
confers Dipteran inhibitoiy activity to Cry2Ab protein. Thus, an additional means for distinguishing 
Cry2A 8-endotoxins which are within the scope of the present invemion could encompass a similarity 
comparison of this region of the protein, taking into consideration the level of homology to be avoided 
when comparing any particular Ciy2A 8-endotoxins to this region in Cry2Aa. The variable amino acids 
within this 76 amino acid sequence domain, Cry2A 6-endotoxins which are intended to be within the 
scope of the present invention would preferably be those which are more than from about 80 to about 99 
percent similar to Co.2Aa widiin this sequence, or more preferably those which are more than from about 
60 to about 79 percent similar to Cry2Aa within this sequence, or those which are more than from about 
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40 to about 59 pe««„t similar to Cn.2Aa within this sequence, or even more preferably those which are 
more than from about 24 to about 39 percent similar to Ciy2Aa within this sequence, or most preferably 
those Ciy2A 5-endotoxins which are more than from about 0 to about 23 percent similar to Cry2Aa 
within this sequence. 

4.7 Transformed Plant Cells and Transgenic Plants 

A plant transformed with an expression vector of the present invention is also contemplated. A 
transgenic plant derived from such a transformed or transgenic cell is also contemplated. Those skilled in 
the art will recognize Aat a chimeric plant gene containing a structural coding sequence of the present 
mvention can be inserted into the genome of a plant by methods well known in the art. Such methods for 
DNA transfonnation of plant cells include Agrobacterium.medht,d plant transformation, the use of 
liposomes, transformation using viruses or pollen, electropomtion. p.x>toplast transformation, gene 
transfer mto pollen, injection into reprx)ductive organs, injection into immature embryos and particle 
bombardment. Each of these methods has distinct advantages and disadvantages. TT,us, one particular 
method of introducing genes into a particular plant strain may not necessarily be the most effective for 
another plant strain, but it is well known which methods are useful for a particular plant strain. 

There are many methods for introducing transforming DNA segments into cells, but not all are 
suitable for delivering DNA to plant ceils. Suitable methods are believed to include virtually any method 
by wh.ch DNA can be introduced into a cell, such as infection by A. tumefaciens and related 
Agrobacterium strains, direct delive-y of DNA such as, for example, by PEG-mediated transfonnation of 
protoplasts (Omirulleh et al, 1993), by desiccation/inhibition-mediatedDNA uptake, by electroporation 
by agitation with silicon carbide fibers, by acceleration of DNA coated particles, etc. In certain 
embodiments, acceleration methods are preferred and include, for example, microprojectile bombardment 
and the like. 

Technology for introduction of DNA into cells is well-known to those of skill in the art. Four 
geneml methods for delivering a gene into cells have been described: (1) chemical methods (Graham and 
van der Eb. 1973); (2) physical methods such as microinjection (Capecchi, 1980), electroporation (Wong 
and Neumamr. 1982; Fromm etaU 1985) and the gene gun (Johnston and Tang, 1994- Fynan eial 
1993); (3) viral vectors (Clapp, 1993; Lu etal, 1993; Eglitis and Anderson, 1988a; 1988b); and (4^ 
receptor-mediated mechanisms (Curiele/ a/., 1991; 1992; ^z^^^xetal, 1992). 
4.7.1 Electroporation 

The application of brief, high-voltage electric pulses to a variety of animal and plant cells leads to 
the formation of nanometer-sized pores in the plasma membrane. DNA is taken directly into the cell 
cytoplasm either through these pores or as a consequence of the redistribution of membrane components 
that accompanies closure of the pores. Electroporation can be extremely efficient and can be used both 
for transient expression of cloned genes and for establishment of cell lines that carry integrated copies of 
the gene of interest. Electroporation, in contrast to calcium phosphate-mediated transfection and 
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protoplast fusion, frequently gives rise to cell lines that carry one. or at most a few, integrated copies of 
the foreign DNA. 

The introduction of DNA by means of electroporation is well-known to those of skill in the art. 
To effect transformation by electroporation, one may employ either friable tissues such as a suspension 
culture of cells, or embiyogenic callus, or alternatively, one may transfom, immature embryos or other 
organized tissues directly. One would partially degrade the cell walls of the chosen cells by exposing 
them to pectin-degrading enzymes (pectolyases) or mechanically wounding in a controlled manner, 
rendering the cells more susceptible to transformation. Such cells would then be recipient to DNA 
transfer by electroporation. which may be carried out at this stage, and transformed cells then identified 
by a suitable selection or screening protocol dependent on the nature of the newly incorporated DNA. 
4.7.2 Microprojectile Bombardment 

A further advantageous method for delivering transforming DNA segments to plant cells is 
microprojectile bombardment. In this method, particles may be coated with nucleic acids and delivered 
into ceils by a propelling force. Exemplary particles include those comprised of tungsten, gold, platinum, 
and the like. Using these particles, DNA is carried through the cell wall and into the cytoplasm on the 
surface of small metal particles as described (Klein et al, 1987; Klein etal, 1988; Kawata et al, 1988). 
The metal particles penetrate through several layers of cells and thus allow the transformation of cells 
within tissue explants. The microprojectile bombardment method is preferred for the identification of 
chloroplast or plastid directed transformation events. 

An advantage of microprojectile bombardment, in addition to it being an effective means of 
reproducibly stably transforming plant cells, is that neither the isolation of protoplasts (Cristou et al., 
1988) nor the susceptibility to Agrobacterium infection is required. An illustrative embodiment of a 
method for delivering DNA into plant cells by acceleration is a Biolistics Particle Delivery System, which 
can be used to propel particles coated with DNA or cells through a screen, such as a stainless steel or 
Nytex screen, onto a filter surface covered with the plant cultured cells in suspension. The screen 
disperses the particles so that they are not delivered to the recipient cells in large aggregates. It is 
believed that a screen intervening between the projectile apparatus and the cells to be bombarded reduces 
the size of projectiles aggregate and may contribute to a higher frequency of transformation by reducing 
damage inflicted on the recipient cells by projectiles that are too large. 

For the bombardment, cells in suspension are preferably concentrated on filters or solid culture 
medium. Alternatively, immature embryos or other target cells may be arranged on solid culture medium. 
The cells to be bombarded are positioned at an appropriate distance below the microprojectile stopping 
plate. If desired, one or more screens are also positioned between the acceleration device and the cells to 
be bombarded. Through the use of techniques set forth herein one may obtain up to 1000 or more foci of 
cells transiently expressing a marker gene. The number of cells in a focus which express the exogenous 
gene product 48 hours post-bombardment often range fi-om 1 to 1 0 and average I to 3. 
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In bombardment transformation, one may optimize the piebombardment culturing conditions and 
the bombardment parameters to yield the maximum numbers of stable transformants. Both the physical 
and biological parameters for bombardment ai« important in this technology. Physical factors are those 
that involve manipulating the DNA/microprojectile precipitate or those that affect the flight and velocity 
of either the macro- or microprojectiles. Biological factors include all steps involved in manipulation of 
cells before and immediately after bombardment, the osmotic adjustment of target cells to help alleviate 
the trauma associated with bombardment, and also the nature of the transforming DNA. such as linearized 
DNA or intact supercoiled plasmids. It is believed that pre-bombardment manipulations are especially 
important for successful transformation of immature plant embryos. 

Accordingly, it is contemplated that one may desire to adjust various of the bombardment 
parameters in small scale studies to fully optimize the conditions. One may particularly wish to adjust 
physical parameters such as gap distance, flight distance, tissue distance, and helium pressure. One may 
also minimize the trauma reduction factors (TRFs) by modifying conditions which influence the 
physiological state of the recipient cells and which may therefore influence transformation and integration 
efficiencies. For example, the osmotic state, tissue hydration and the subculture stage or cell cycle of the 
recipient cells may be adjusted for optimum transformation. The execution of other routine adjustments 
will be known to those of skill in the art in light of the present disclosure. 

The methods of particle-mediated transformation is well-known to those of skill in the art. U. S. 
Patent 5,015,580 (specifically incorporated herein by reference) describes the transformation of soybeans 
using such a technique. 
4.7.3 Agrobacterium-Mediated Transfer 

Agrobactemm-mediiited transfer is a widely applicable system for introducing genes into plant 
cells because the DNA can be introduced into whole plam tissues, thereby bypassing the need for 
regeneration of an intact plant from a protoplast. The use of Agrobacterium-m^iatcd plant integrating 
vectors to introduce DNA into plant cells is well known in the art. See, for example, the methods 
described (Fraley e, al., 1985; Rogers et al, 1987). The genetic engineering of cotton plants using 
^groWrmm-mediated transfer is described in U. S. Patem 5,004,863 (specifically incorporated herein 
by reference); like transformation of lettuce plants is described in U. S. Patent 5,349,124 (specifically 
incorporated herein by reference); and die Agrobacterium-mtdi^X^ transformation of soybean is 
described in U. S. Patent 5.416,011 (specifically incorporated herein by reference). Further, the 
integration of the Ti-DNA is a relatively precise process resulting in few rearrangements. The region of 
DNA to be transferred is defined by the border sequences, and intervening DNA is usually inserted into 
the plant genome as described (Spielmann et al, 1986; Jorgensen et al, 1987). 

Modem Agrobacterivm transformation vectors are capable of replication in E. coli as well as 
Agrobacterium, allowing for convenient manipulations as described (Klee et al, 1985). Moreover, recent 
technological advances in vectors for Agrobacterium-m^dr^xcd gene transfer have improved the 
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arrangement of genes and restriction sites in the vectors to facilitate construction of vectors capable of 
expressing various polypeptide coding genes. The vectors described (Rogers et al, 1987), have 
convenient multi-linker regions flanked by a promoter and a polyadenylation site for direct expression of 
inserted polypeptide coding genes and are suitable for present purposes. In addition, Agrobacterium 
containing both armed and disarmed Ti genes can be used for the transfomations. In those plant varieties 
where Agrobacterium-m^istQA transfoimation is efficient, it is the method of choice because of the facile 
and deflned nature of the gene transfer. 

Agrobacterium-mtdx^X^A transformation of leaf disks and other tissues such as cotyledons and 
hypocotyls appears to be limited to plants that Agrobactehvm naturally infects. Agrobacteriwn-m^mQd 
transformation is most efficient in dicotyledonous plants. Few monocots appear to be natural hosts for 
Agrobacterium, although transgenic plants have been produced in asparagus nsing Agrobacterium vectore 
as described (Bytebier et al., 1987). Other monocots recently have also been transformed with 
Agrobacterium. Included in this group are corn (Ishida et al.) and rice (Cheng et al.). 

A transgenic plant formed using Agrobacterium transformation methods typically contains a 
single gene on one chromosome. Such transgenic plants can be referred to as being heterozygous for the 
added gene. However, inasmuch as use of the word "heterozygous" usually implies the presence of a 
complementary gene at the same locus of the second chromosome of a pair of chromosomes, and there is 
no such gene in a plant containing one added gene as here, it is believed that a more accurate name for 
such a plant is an independem segregant. because the added, exogenous gene segregates independently 
during mitosis and meiosis. 

An independent segregant may be preferred when the plant is commercialized as a hybrid, such as 
com. In this case, an independent segregant containing the gene is crossed with another plant, to form a 
hybrid plant that is heterozygous for the gene of interest. 

An alternate preference is for a transgenic plant that is homozygous for the added structural gene; 
i.e. a transgenic plant that contains two added genes, one gene at the same locus on each chromosome of a 
chromosome pair. A homozygous transgenic plant can be obtained by sexually mating (selfing) an 
independent segregam transgenic plant that contains a single added gene, germinating some of the seed 
produced and analyzing the resulting plants produced for gene of interest activity and mendelian 
inheritance indicating homozygosity relative to a control (native, non-transgenic) or an independent 
segregant transgenic plant. 

Two different transgenic plants can be mated to produce offspring that contain two independently 
segregating added, exogenous genes. Selfing of appropriate progeny can produce plants that are 
homozygous for both added, exogenous genes that encode a polypeptide of interest. Back-crossing to a 
parental plant and out-crossing with a non-transgenic plant are also contemplated. 
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Transfonnation of plant protoplasts can be achieved using methods based on calcium phosphate 
precipitation, polyethylene glycol treatment, electroporation. and combinations of these treatments (see 
e.g.. Potiykus ei al, 1985; Lorz et al, 1985; Fromm et al, 1985; Uchimiya et ai, 1986; Callis et al, 
1 987; Marcotte et al, 1 988). 

Application of these systems to different plant germplasm depends upon the ability to regenerate 
that particular plant variety from protoplasts. Illustrative methods for the regeneration of cereals ftx)m 
protoplasts are described (see, eg., Fujimura et al, 1985; Toriyama et al, 1986; Yamada et al, 1986; 
Abdullah g/ a/., 1986). 

To transform plant germplasm that cannot be successfully regenerated from protoplasts, other 
ways to introduce DNA into intact cells or tissues can be utilized. For example, regeneration of cereals 
from immature embryos or explants can be effected as described (Vasil, 1988). 
4.8 Gene Expression in Plants 

Unmodified bacterial genes are often poorly expressed in transgenic plant cells. Plant codon 
usage more closely resembles that of humans and other higher organisms than unicellular organisms, such 
as bacteria. Several reports have disclosed methods for improving expression of recombinam genes in 
plants ( Murray er«/., 1989;Diehn et al, 1996; lannacone e/a/., 1997; Rouwendal el a/., 1997; Futterer 
et al, 1 997; and Futterer and Hohn, 1 996). These reports disclose various methods for engineering coding 
sequences to represent sequences which are more efficiently translated based on plant codon frequency 
tables, improvements in codon third base position bias, using recombinant sequences which avoid suspect 
polyadenylation or AA" rich domains or intron splicing consensus sequences. While these methods for 
synthetic gene construction are notable, synthetic genes of the present invention were prepared according 
to the method of Brown et al. (US Pat. No. 5,689,052; 1997), which is herein incorporated in its entirety 
by reference. Thus, the present invention provides a method for preparing synthetic plant genes express 
in planta a desired protein product at levels significantly higher than the wild-type genes. Briefly, 
according to Brown et al., the frequency of rare and semi-rare monocotyledonous codons in a 
polynucleotide sequence encoding a desired protein are reduced and replaced with more preferred 
monocotyledonous codons. Enhanced accumulation of a desired polypeptide encoded by a modified 
polynucleotide sequence in a monocotyledonous plant is the result of increasing the frequency of 
preferred codons by analyzing the coding sequence in successive six nucleotide fragments and altering the 
sequence based on the frequency of appearance of the six-mers as to the frequency of appearance of the 
rarest 284, 484, and 664 six-mers in monocotyledonous plants. Furthermore, Brown et al. disclose the 
enhanced expression of a recombinant gene by applying the method for reducing the frequency of rare 
codons with methods for reducing the occurrence of polyadenylation signals and intron splice sites in the 
nucleotide sequence, removing self-complementary sequences in the nucleotide sequence and replacing 
such sequences with nonself-complementaiy nucleotides while maintaining a structural gene encoding the 
polypeptide, and reducing the frequency of occurrence of 5'-CG-3' dinucleotide pairs in the nucleotide 
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sequence. These steps are performed sequentially and have a cumulative effect resulting in a nucleotide 
sequence containing a preferential utilization of the more-prefeired monocotyledonous codons for 
monocotyledonous plants for a majority of the amino acids present in the desired polypeptide. 

The work described herein has identified methods of potentiating in planta expression of B. 
thwingiensis 8-endotoxins. which confer resistance to insect pathogens when incorporated into the 
nuclear, plastid, or chloroplast genome of susceptible plants. U. S. Patent 5,500,365 (specifically 
incorporated herein by reference) describes a method for synthesizing plant genes to optimize the 
expression level of the protein for which the synthesized gene encodes. This method relates to the 
modification of the structural gene sequences of the exogenous transgene, to make them more "plant-like" 
and therefore more likely to be translated and expressed by the plant, monocot or dicot. However, the 
method as disclosed in U. S. Patem 5,689,052 provides for enhanced expression of transgenes, preferably 
in monocotyledonous plants. 

4.9 Production of Insect-Resistant Transgenic Plants 

Thus, the amount of a gene coding for a polypeptide of interest (i.e. a bacterial crystal protein or 
5-endotoxin polypeptide and a plastid targeting peptide) can be increased in plants by transforming those 
plants using transformation methods such as those disclosed herein at Section 4.7. In particular, 
chloroplast or plastid transformation can result in desired coding sequences being present in up to about 
10,000 copies per cell in tissues containing these subcellular organelle structures (McBride et al., 
Bio/Technology 13:362-365, 1995). 

DNA can also be introduced into plants by direct DNA transfer into pollen as described (Zhou et 
al., 1983; Hess, 1987). Expression of polypeptide coding genes can be obtained by injection of the DNA 
into reproductive organs of a plant as described (Pena et al., 1987). DNA can also be injected directly 
into the cells of immature embryos and the rehydration of desiccated embryos as described (Neuhaus et 
al, i987;BenbrookeM/., 1986). 
4.9.1 Selection of Transformed Cells 

After effecting delivery of exogenous DNA to recipient cells, the next step to obtain a transgenic 
plant generally concern identifying the transformed ceils for further culturing and plant regeneration. As 
mentioned herein, in order to improve the ability to identify transformants, one may desire to employ a 
selectable or screenable marker gene as, or in addition to, the expressible gene of interest. In this case, 
one would then generally assay the potentially transformed cell population by exposing the cells to a 
selective agent or agents, or one would screen the cells for the desired marker gene trait. 

An exemplary embodiment of methods for identifying transformed cells involves exposing the 
transformed cultures to a selective agent, such as a metabolic inhibitor, an antibiotic, herbicide or the like. 
Cells which have been transformed and have stably integrated a marker gene conferring resistance to the 
selective agent used, will grow and divide in culture. Sensitive cells will not be amenable to further 
culturing. One example of a preferred marker gene confers resistance to glyphosate. When this gene is 
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used as a selectable marker, the putatively transformed cell culture is treated with glyphosate. Upon 
treatment, transgenic cells will be available for further culturing while sensitive, or non-transformed cells, 
will not. This method is described in detail in U. S. Patent 5,569.834, which is specifically incorporated 
herein by reference. Another example of a preferred selectable marker system is the neomycin 
phosphotransferase (nptll) resistance system by which resistance to the antibiotic kanamycin is 
conferred, as described in U. S. Patent 5.569,834 (specifically incorporated herein by reference). Again, 
after transformation with this system, transformed cells will be available for further culturing upon 
treatment with kanamycin, while non-transformed cells will not. Yet another preferred selectable marker 
system involves the use of a gene construct conferring resistance to paromomycin. Use of this type of a 
selectable marker system is described in U. S. Patent 5,424,412 (specifically incorporated herein by 
reference). 

All contemplated assays are nondestructive and transformed cells may be cultured further 
following identification. Another screenable marker which may be used is the gene coding for green 
fluorescent protein. 

Transplastonomic selection (selection of plastid or chloroplast transformation events) is 
simplified by taking advantage of the sensitivity of chloroplasts or plastids to spectinomycin, an inhibitor 
of plastid or chloroplast protein synthesis, but not of protein synthesis by the nuclear genome encoded 
cytoplasmic ribosomes. Spectinomycin prevents the accumulation of chloroplast proteins required for 
photosynthesis and so spectinomycin resistant transformed plam cells may be distinguished on the basis 
of their difference in color: the resistant, transformed cells are green, whereas the sensitive cells are white, 
due to inhibition of plastid-protein synthesis. Transformation of chloroplasts or plastids with a suitable 
bacterial aad gene, or with a gene encoding a spectinomycin resistant plastid or chloroplast functional 
ribosomal RNA provides a means for selection and maintenance of transplastonomic events (Maliga, 
Trends in Biotechnology 1 1 : 1 0 1 - 1 06, 1 993). 

It is further contemplated that combinations of screenable and selectable markers will be useful 
for identification of transformed cells. In some cell or tissue types a selection agent, such as glyphosate 
or kanamycin, may either not provide enough killing activity to clearly recognize transformed cells or 
may cause substantial nonselective inhibition of transformants and nontransformants alike, thus causing 
the selection technique to not be effective. It is proposed that selection with a growth inhibiting 
compound, such as glyphosate at concentrations below those that cause 100% inhibition followed by 
screening of growing tissue for expression of a screenable marker gene such as kanamycin would allow 
one to recover transformants from cell or tissue types that are not amenable to selection alone. It is 
proposed that combinations of selection and screening may enable one to identify transformants in a 
wider variety of cell and tissue types. 
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4.9.2 Regeneration of Transfomiante 

The development or regeneration of plants from either single plant protoplasts or various explants 
is well known in the art (Weissbach and Weissbach, 1988). This regeneration and growth process 
typically includes the steps of selection of transfoiroed cells, culturing those individualized cells through 
the usual stages of embryonic development through the rooted plantlet stage. Transgenic embryos and 
seeds are similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an 
appropriate plant growth medium such as soil. 

The development or regeneration of plants containing the foreign, exogenous gene that encodes a 
polypeptide of interest introduced by Agrobacterium from leaf explants can be achieved by methods well 
known in the art such as described (Horsch et al, 1985). In this procedure, transformants are cultured in 
the presence of a selection agent and in a medium that induces the regeneration of shoots in the plant 
strain being transformed as described (Fraley et al, 1983). In particular, U. S. Patent 5,349,124 
(specification incorporated herein by reference) details the creation of genetically transformed lettuce 
cells and plants resulting therefrom which express hybrid crystal proteins conferring insecticidal activity 
against Lepidopteran larvae to such plants. 

This procedure typically produces shoots within two to four months and those shoots are then 
transferred to an appropriate root-inducing medium containing the selective agent and an antibiotic to 
prevent bacterial growth. Shoots that rooted in the presence of the selective agent to form plantlets ai« 
then transplanted to soil or other media to allow the production of roots. These procedures vary 
depending upon the particular plant strain employed, such variations being well known in the art. 

Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic plants, or 
pollen obtained from the regenerated plants is crossed to seed-grown plants of agronomically important, 
preferably inbred lines. Conversely, pollen from plants of those important lines is used to pollinate 
regenerated plants. A transgenic plant of the present invention containing a desired polypeptide is 
cultivated using methods well known to one skilled in the art. 

A transgenic plant of this invention thus has an increased amount of a coding region encoding a 
B. thwingiensis 8-endotoxin polypeptide and a plastid targeting peptide. A preferred transgenic plant is 
an independent segregant and can transmit tiiat gene and its activity to its progeny. A more preferred 
transgenic plant is homozygous for that gene, and transmits that gene to all of its offspring on sexual 
mating. Seed from a transgenic plant may be grown in the field or greenhouse, and resuhing sexually 
mature transgenic plants are self-pollinated to generate tnie breeding plants. The progeny from these 
plants become true breeding lines that are evaluated for increased expression of the B. thwingiensis 
transgene. 

4.10 Identification of Transgenic Plant Events with Insect Tolerance 

To identify a transgenic plant expressing high levels of the 8-endotoxin of interest, it is necessary 
to screen the herbicide or antibiotic resistant transgenic, regenerated plants (Ro generation) for insecticidal 
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activhy and/or expression of the gene of interest. This can be accomplished by various methods well 
known to those skilled in the art. including but not limited to: 1) obtaining small tissue samples from the 
transgenic Ro plant and directly assaying the tissue for activity against susceptible insects in parallel with 
tissue derived from a non-expressing, negative control plant. For example, Ro transgenic com plants 
expressing B. thuringiensis endotoxins such as Ciy2Ab can be identified by assaying leaf tissue derived 
from such plants for activity against ECB; 2) analysis of protein extracts by enzyme linked immunoassays 
(ELISAs) specific for the gene of interest (Cry2Ab); or 3) reverse transcriptase PGR™ (RT PGR™) to 
identify events expressing the gene of interest 
4.11 Isolating Homologous Gene and Gene Fragments 

The genes and 8-endotoxins according to the subject invention include not only the full length 
sequences disclosed herein but also fragments of these sequences, or fusion proteins, which retain the 
characteristic insecticidal activity of the sequences specifically exemplified herein. 

It should be apparent to a person of skill in this art that insecticidal 5-endotoxins can be identified 
and obtained through several means. The specific genes, or portions thereof, may be obtained from a 
culture depository, or constructed synthetically, for example, by use of a gene machine. Variations of 
these genes may be readily constructed using standard techniques for making point mutations. Also, 
fragments of these genes can be made using commeroially available exonucleases or endonucleases 
according to standard procedures. For example, enzymes such as Bal3\ or site-directed mutagenesis can 
be used to systematically cut off nucleotides from the ends of these genes. Also, genes which code for 
active fragments may be obtained using a variety of other restriction enzymes. Proteases may be used to 
directly obtain active fragments of these S-endotoxins. 

Equivalent 6-endotoxins and/or genes encoding these 5-endotoxins can also be isolated from 
Bacillus strains and/or DNA libraries using the teachings provided herein. For example, antibodies to the 
5-endotoxins disclosed and claimed herein can be used to identify and isolate other 5-endotoxins from a 
mixture of proteins. Specifically, antibodies may be raised to the portions of the 5-endotoxins which are 
most constant and most distinct from other B. thuringiensis 5-endotoxins. These antibodies can then be 
used to specifically identify equivalent 5-endotoxins with the characteristic insecticidal activity by 
immunoprecipitation, enzyme linked immunoassay (ELISA), or Western blotting. 

A fiirther method for identifying the 5-endotoxins and genes of the subject invention is through 
the use of oligonucleotide probes. These probes are nucleotide sequences having a detectable label. As is 
well known in the art, if the probe molecule and nucleic acid sample hybridize by forming a strong bond 
between the two molecules, it can be reasonably assumed that the probe and sample are essentially 
identical. The probe's detectable label provides a means for determining in a known manner whether 
hybridization has occurred. Such a probe analysis provides a rapid method for identifying insecticidal 
5-endotoxin genes of the subject invention. 
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The nucleotide segments which are used as probes according to the invention can be synthesized 
by use of DNA synthesizers using standard procedures. In the use of the nucleotide segments as probes 
the particular probe is labeled with any suitable label known to those skilled in the art, including 
radioactive and non-radioactive labels. Typical radioactive labels include «P, '^'l. ^^s. or the like A 
probe labeled with a radioactive isotope can be constructed from a nucleotide s«,uence complementaiy to 
the DNA sample by a conventional nick translation reaction, using a DNase and DNA polymerase The 
probe and sample can then be combined in a hybridization buffer solution and held at an appropriate 
temperature until annealing occurs. Thereafter, the membrane is washed free of extraneous materials 
leavmg the sample and bound probe molecules typically detected and quantified by autoradiography 
and/or liquid scintillation counting. 

Non-radioactive labels include, for example, ligands such as biotin or thyroxin, as well as 
enzymes such as hydrolyses or peroxidases, or the various chemiluminescers such as Juciferin or 
fluorescent compounds like fluorescein and its derivatives. The probe may also be labeled at both ends 
wth d.fferent types of labels for ease of separation, as, for example, by using an isotopic label at the end 
mentioned above and a biotin label at the other end. 

Duplex formation and stability depend on substantial complementao' between the two strands of 
a hybrid, and, as noted above, a certain degree of mismatch can be tolerated. Therefore, the probes of the 
subject invention include mutations (both single and multiple), deletions, insertions of the described 
sequences, and combinations thereof, wherein said mutations, insertions and deletions pennit formation 
of stable hybrids with the target polynucleotide of interest. Mutations, insertions, and deletions can be 
produced in a given polynucleotide sequence in many ways, by methods currently known to an ordinarily 
skilled artisan, and perhaps by other metfiods which may become known in the future. 

The potential variations in the probes listed is due. in part, to the redundancy of the genetic code 
Because of the redundancy of the genetic code, more than one coding nucleotide triplet (codon) can be 
used for most of the amino acids used to make proteins. Therefore different nucleotide sequences can 
code for a particular amino acid. Thus, the amino acid sequences of the B. thuringiemis 8-e„dotoxins and 
peptides, and the plastid targeting peptides and the polynucleotides which code for them, can be prepared 
by equivalent nucleotide sequences encoding the same amino acid sequence of the protein or peptide 
Accordingly, the subject invention includes such equivalent nucleotide sequences. Also, inverse or 
complement sequences are an aspect of the subject invention and can be readily used by a person skilled 
.n this art. In addition it has been shown that proteins of identified structure and function may be 
constructed by changing the amino acid sequence if such changes do not alter the protein secondary 
structure (Kaiser and Kezdy. 1984). Thus, the subject invention includes mutants of the amino acid 
sequence depicted herein which do not alter the protein secondary structure, or if the structure is altered 
the biological activity is substantially retained. Further, the invention also includes mutants of organisms 
hosting all or part of a gene encoding a 8-endotoxin and gene encoding a plastid targeting peptide as 
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discussed in the present invention. Such mutants can be made by techniques well known to persons 
skilled in the art. For example, UV irradiation can be used to prepare mutants of host organisms. 
Likewise, such mutants may include asporogenous host cells which also can be prepared by procedures 
well known in the art. 
4.12 Site-Specific Mutagenesis 

Site-specific mutagenesis is a technique useful in the preparation of individual peptides, or 
biologically functional equivalent proteins or peptides, through specific mutagenesis of the underlying 
DNA. The technique further provides a ready ability to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more nucleotide 
sequence changes into the DNA. Site-specific mutagenesis allows the production of mutants through the 
use of specific oligonucleotide sequences which encode the DNA sequence of the desired mutation, as 
well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and 
sequence complexity to form a stable duplex on both sides of the deletion junction being traversed. 
Typically, a primer of about 17 to 25 nucleotides in lengtii is preferred, with about 5 to 10 residues on 
both sides of the junction of the sequence being altered. 

In general, the technique of site-specific mutagenesis is well known in the art, as exemplified by 
various publications. As will be appreciated, the technique typically employs a phage vector which exists 
in both a single stranded and double stranded form. Typical vectors useful in site-directed mutagenesis 
include vectors such as the M13 phage. These phage are readily commercially available and their use is 
generally well known to those skilled in the art. Double stranded plasmids are also routinely employed in 
site directed mutagenesis which eliminates the step of transferring the gene of interest from a plasmid to a 
phage. 

In general, site-directed mutagenesis in accordance herewith is performed by first obtaining a 
single-stranded vector or melting apart of two strands of a double stranded vector which includes within 
its sequence a DNA sequence which encodes the desired peptide. An oligonucleotide primer bearing the 
desired mutated sequence is prepared, generally synthetically. This primer is then annealed with the 
single-stranded vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase 1 Klenow 
fragment, in order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is 
formed wherein one strand encodes the original non-mutated sequence and the second strand bears the 
desired mutation. This heteroduplex vector is then used to transform appropriate cells, such as £. coli 
cells, and clones are selected which include recombinant vectors bearing the mutated sequence 
arrangement. 

The preparation of sequence variants of the selected peptide-encoding DNA segments using site- 
directed mutagenesis is provided as a means of producing potentially useful species and is not meant to be 
limiting as there are other ways in which sequence variants of peptides and the DNA sequences encoding 
them may be obtained. For example, recombinant vectors encoding the desired peptide sequence may be 
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treated with mutagenic agents, such as hydroxylamine, to obtain sequence variants. Such procedures may 
favorably change the protein's biochemical and biophysical characteristics or its mode of action. These 
include, but are not limited to: 1) improved 6-endotoxin formation, 2) improved protein stability or 
reduced protease degradation, 3) improved insect membrane receptor recognition and binding, 4) 
improved oligomerization or channel formation in the insect midgut endothelium, and 5) improved 
insecticidal activity or insecticidal specificity due to any or all of the reasons stated above. 
4.13 Biological Functional Equivalents 

Modification and changes may be made in the structure of the peptides of the present invention 
and DNA segments which encode them and still obtain a functional molecule that encodes a protein or 
peptide with desirable characteristics. The biologically functional equivalent peptides, polypeptides, and 
proteins contemplated herein should possess about 80% or greater sequence similarity, preferably about 
85% or greater sequence similarity, and most preferably about 90% or greater sequence similarity, to the 
sequence of, or corresponding moiety within, the fundamental ciy2Ab amino acid sequence. 

The following is a discussion based upon changing the amino acids of a protein to create an 
equivalent, or even an improved, second-generation molecule. In particular embodiments of the 
invention, mutated crystal proteins are contemplated to be useful for increasing the insecticidal activity of 
the protein, and consequently increasing the insecticidal activity and/or expression of the recombmant 
transgene in a plant cell. The amino acid changes may be achieved by changing the codons of the DNA 
sequence, according to the codons given in Table 3. 

Table 3 



Ammo Acid Codons 


Aianme 


Ala 


A 


OCA 


GCC 


GCG 


GCU 


Cysteine 


Cys 


C 


UGC 


UGU 






Aspartic acid 


Asp 


D 


GAC 


GAU 






Glutamic acid 


Glu 


E 


GAA 


GAG 






Phenylalanine 


Phe 


F 


UUC 


UUU 






Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 


Histidine 


His 


H 


CAC 


CAU 






Isoleucine 


He 


I 


AUA 


AUC 


AUU 




Lysine 


Lys 


K 


AAA 


AAG 






Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue CUG CUU 


Methionine 


Met 


M 


AUG 








Asparagine 


Asn 


N 


AAC 


AAU 







wo 00/26371 



PCT/US99/26086 



-45 



Acid ■ Codons 


rroiine 


Pro 


P 


CCA 


ccr 




ecu 


Glutamine 


Gin 


o 


PA A 


LAO 






Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


Serine 


Ser 


S 


AC5C 


AGU 


UCA 


UCC 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 


Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 


Tryptophan 


Trp 


W 


UGG 








Tyrosine 


Tyr 


Y 


UAC 


UAU 







For example, certain amino acids may be substituted for other amino acids in a protein structure 
w,thout appreciable loss of interactive binding capacity with structures such as, for example antigen- 
bmdmg regions of antibodies or binding sites on substrate molecules. Since it is the interactive capacity 
and nature of a protein that defines that protein's biological functional activity, certain amino acid 
sequence substitutions can be made in a protein sequence, and, of course, its underlying DNA coding 
sequence, and nevertheless obtain a protein with like properties. It is thus contemplated by the inventors 
that various changes may be made in the peptide sequences of the disclosed compositions or 
corresponding DNA sequences which encode said peptides without appreciable loss of their biological 
Utility or activity. 

In making such changes, the hydropathic index of amino acids may be considered The 
importance of the hydropathic amino acid index in confemng interactive biologic function on a protein is 
generally understood in the art (Kyte and Doolittle, 1982. incorpoi^te herein by reference). It is accepted 
that the relative hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other molecules, for example 
enzymes, substrates, receptors, DNA, antibodies, antigens, and the like. 

Each amino acid has been assigned a hydropathic index on the basis of their hydrophobicity and 
charge characteristics (Kyte and Doolittle, 1982). these are: isoleucine (+4.5); valine (+4.2); leucine 
(+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1 .9); alanine (+1.8); glycine (-0 4)- 
threonine (-0.7); senne (-0.8); t^^ptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3 2)' 
glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (^.5) ' 
It IS known in the art that certain amino acids may be substituted by other amino acids having a 
similar hydropathic index or score and still result in a protein with similar biological activity ie still 
obtam a biological functionally equivalent protein. In making such changes, the substitution of amino 
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acids whose hydropathic indices are within ±2 is preferred, those which are within ±1 are particularly 

preferred, and those within ±0.5 are even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids can be made effectively on 

the basis of hydrophilicity. U. S. Patent 4,554.101, incoiporated herein by reference, states that the 

greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino 

acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554, 101, the following hydrophilicity values have been assigned to 
amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate (+3.0 ± 1); serine 
(+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ± 1); alanine (- 
0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); 
tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 

It is understood that an amino acid can be substituted for another having a similar hydrophilicity 
value and still obtain a biologically equivalent, and in particular, an immunologically equivalent protein. 
In such changes, the substitution of amino acids whose hydrophilicity values are within ±2 is prefen^, 
those which are within +1 are particularly preferred, and those within +0.5 are even more particularly 
preferred. 

As outlined above, amino acid substitutions are generally therefore based on the relative 
similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, 
charge, size, and the like. Exemplary substitutions which take various of the foregoing characteristics 
into consideration are well known to those of skill in the art and include: arginine and lysine; glutamate 
and aspartate; serine and threonine; glutamine and asparagine; and valine, leucine and isoleucine. 

Polynucleotides encoding 5-endotoxins derived from B. thuringiemis are known by those skilled 
in the art, to be pooriy expressed when incorporated into the nuclear DNA of transgenic plants (reviewed 
by Diehn et ai, 1996). Preferably, a nucleotide sequence encoding the 8-endotoxin of interest is designed 
essentially as described in U. S. Patent 5,500,365 and 5,689,052 (each specifically incorpomted herein by 
reference). Examples of nucleotide sequences useful for expression include but are not limited to crylAh 
(SEQIDNO:!). 

Peptides, polypeptides, and proteins biologically functionally equivalent to Cry2Ab include 
amino acid sequences containing conservative amino acid changes in the fundamental sequence shown in 
SEQ ID N0:2. In such amino acid sequences, one or more amino acids in the fundamental sequence is 
(are) substituted with another amino acid(s), the charge and polarity of which is similar to that of the 
native amino acid, i.e. a conservative amino acid substitution, resulting in a silent change. 

Substitutes for an amino acid within the fundamental polypeptide sequence can be selected from 
other members of the class to which the naturally occurring amino acid belongs. Amino acids can be 
divided into the following four groups: (1) acidic amino acids; (2) basic amino acids; (3) neutral polar 
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amino acids; and (4) neutral non-polar amino acids. Repr^senmtive amino acids within tiiese various 
groups mclude, but are not limited to: (1) acidic (negatively charged) amino acids such as aspartic acid 
and glutamic acid; (2) basic (positively charged) amino acids such as arginine, histidine and lysine- (3) 
neutral polar amino acids such as glycine, serine, threonine, cysteine, cystine, tyrosine, asparagine and 
glutamme; (4) neutral nonpolar (hydrophobic) amino acids such as alanine, leucine, isoleucine, valine, 
prohne, phenylalanine, tryptophan, and methionine. 

Conservative amino acid changes within the fundamental polypeptide sequence can be made by 
subsftutrng one amino acid within one of these g„,ups with another amino acid within the same group 
B.olog,caily functional equivalents of cr,2Ab can have 10 or fewer conservative amino acid changes 
more preferably seven or fewer conservative amino acid changes, and most preferably five or fewe^ 
conservative amino acid changes. IHe encoding nucleotide sequence (gene, plasmid DNA cDNA or 
synthetic DNA) will thus have coaesponding base substitutions, permitting it to encode biologicllly 
functional equivalent forms of crv2Ab. 

5.0 Examples 

The following examples are included to demonstrate preferred embodiments of the invention It 
should be appreciated by those of skill in the art that the techniques disclosed in the examples which 
follow represent techniques discovered by the inventor to function well in the practice of the invention 
and thus can be considered to constitute preferred modes for its practice. However, those of skill in the 
art should, in light of the present disclosure, appreciate that many changes can be made in the specific 
embodiments which are disclosed and still obtain a like or similar result without departing from the spirit 
and scope of the invention. 

5.1 Example 1 - Increased Expression of Cry2Ab by Targeted Vectors 

Expression of the Cor2Ab protein in com plants transformed with targeted and non-targeted 
C.y2Ab expression vectors was compared and was significantly higher in plants with the targeted vector 
Untargeted C,y2Ab plant expression vectors pMON26800 and pMON30463 contain an expression 
cassette composed of an enhanced CaMV35S promoter, a maize hsp70 intron, a synthetic crv2Ab gene 
wrth tianslatmnal initiation and termination codons (SEQ ID N0:1). and a nopaline synthase 
poiyadenylation site. 

The targeted plant expression vector pMON30464 (SEQ ID N0:16) contains an expression 
cassette including enhanced CaMV35S promote, a maize hsp70 intron, a maize ssRUBISCO chloroplast 
transit peptide (SEQ ID N0:3) fused in frame to a synthetic co.2Ab gene, and a nopaline synthase 
poiyadenylation site. 

All vectors (pMON26800. pMON30463. and pMON30464) also contain a cassette conferring 
paromomycin resistance to transformed plan, tissue. In the case of pMON26800. this cassette consists of 
an enhanced CaMV35S promoter , a maize hsp70 int^n. a neomycin phosphotransferase gene with a 
translauonal initiation and termination codons. and a nopaline synthase poiyadenylation site. In the case 
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of PMON30463 and pMON30464, this cassette consists of a CaMV35S promoter, a neomycin 
phosphotransferase gene with a translationai initiation and termination codons, and a nopaline synthase 
polyadenylation site. Transgenic com plants resistant to paromomycin were derived essentially as 
described in U. S. Patent 5,424,412 (specifically incorporated herein by reference). 

Leaf tissue from independently transformed transgenic events in the Ro stage was subjected to 
quantitative analysis of Cry2Ab protein levels by a quantitative ELISA assay. This ELISA used a direct 
sandwich technique that used a monoclonal capture antibody raised against Ciy2Aa, a different Cry2Aa 
monoclonal antibody conjugated to alkaline phosphatase as the secondary antibody, and purified Cry2Aa 
protein as a standard. 

Comparison of Cry2Ab expression levels in pMON30463 (non-targeted) and pMON30464 
(targeted) com plants show that non-targeted Co^2Ab expression does not exceed 15 ppm while targeted 
expression is frequently higher than 1 00 ppm (Table 4). Protein blot analyses confirm that the increased 
level of cross reactive material produced by pMON30464 (targeted) were due to increased accumulation 
of an approximately Mr 71,000 protein that co-migrates with Cry2Ab produced by pMON30463 (non- 
targeted) and Cry2Aa standard from B. thuringiensis. This data indicates that the targeting peptide fused 
to the N-terminus of Cry2Ab protein was efficiently processed or removed. 

Increased expression of Cry2Ab in pMON30464 (targeted) vectors relative to pMON26800 (non- 
targeted) vectors was also observed in R, progeny plants derived from the original Ro transgenic events, 
indicating that high expression is heritable (Table 5). 

Table 4 

*^'!Kmn4«J?'^'' '".^ S""" Transformed with Tai^eted (pMON30464) and Untai^eted 
(PMON30463) Expression Vectors: Distribution of Expression Levels in Different EvSts 

^'Stei Total 0 0:5 Sl5 15-50 SO- 100^ ^OQ- 



Events ECB+ ppm ppm ppm ppm 100 200 

ppm ppm 



ppm 



non- 16 3 0 0 3 6 0 0 O" 

targeted (190/^) 



(30463) 

targeted 40 14 

(30464) (35%) 
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TableS 



^TnMOm-Sfnm'?'^'' ^••''"''f"'-™^'^ Tai^eted (pMON30464) and Untargeted 

(PMON2 6800) Expression Vectors: Distribution of Expression Levels in Different Events 

Vector 



non- 
targeted 
(26800) 
targeted 
(30464) 



Total # 0 
events ppm 
assayed 



28 



33 



0 



0-5 
ppm 

~~T8~ 



5-15 
ppm 

10" 



15-50 
ppm 

Q— 



50-100 100-200 >200 
ppm ppm ppm 



0 



0 



17 



To effectively control insects that feed on a variety of corn tissues, it is critical that the 
•nsecticidal protein be expressed at high levels throughout all potential feeding sites. To determine if the 
increases in targeted expression of Cry2Ab occur in other tissues, independent targeted and non-targeted 
transgenic events representing the high expressing lines obtained with the respective vector types were 
assayed for Cry2Ab expression levels in parallel. Expression of Cry2Ab is increased in virtually all of the 
com tissues attacked by pests such as Ostrina mbialis and Helicoverpa zea by targeted expression (Table 
6). Uniform high level expression of this type is especially valuable in that it is less likely to permit 
evolved resistance of target pests via behavioral (feeding) adaptation. 

Table 6 

Targeted an d Untargeted Cry2Ab Expression in Transgenic Maize 

Vector Event N Root Leaf sheath stalk shank husk ^iii; 

NSOpMON #1 i il] 117.6 140.8 514.9 397.5 121.8 130.5 1652 

30464 

#2 4 11.3 + 4.5 105+ 12 121 +25 96+ 18 134 + 38 52 + 9.1 101 + 11 113 + 45 170 + 36 

N30pMON #1 2 1.2 + 0.4 10 + 5.3 20+12 28 + 5.6 2917.5 7.6 + 7.6 46 + 9.9 96 + 96 109 + 46 
26800 

Expression in ng Ciy2Ab / gm fresh weight (root and leaf) or dry weight tissue 
(sheath, stalk, shank, husk, silk, cob, kernel) shown + standard deviation (L30464 #2) 
or range (L26800#l). 

Further analyses indicate that the increased levels of Cry2Ab protein produced by pMON30464 
result in a commensurate increase in the level of bioactivity as measured directly in feeding assays. To 
assess the level of insecticidal activity produced, corn leaf tissue from control (non-transgenic), targeted 
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(pinon30464), and non-targeted (pMON30464) plants was assayed for activity against Heliothis virescem 
in tissue diet overlay studies (Table 7). Two concentrations of tissue (0.0016 and 0.0031 %) were 
bioassayed and the same sample of tissue used in the diet overlay was also subjected to quantitative 
ELISA determinations of Cry2Ab levels. The 7.5-fold increase in Cry2Ab levels in targeted 
(PMON30464) samples relative to the non-targeted (pMON30463) samples clearly correlates with the 
corresponding 6-fold difference in mean larval weight observed at both concentration rates. TTiese data 
thus indicate that the increased levels of Cry2Ab produced by pMON30464 result in commensurate 
increases in the level of bioactivity. 



Table? 

Correlation oflncreased Cry2Ab Expression Levels with Increased Bioactivity in Heliothis 

virescens Tissue Diet Overlay Bioassay 



Tissue Sample 


CryZAb Cone, 
(ppm) 


Tissue Cone. 1 
(0.0031% Tissue) 


Tissue Cone. 2 
(0.0016% Tissue) 


Mean Larval Wt. (mg) 


Mean Larval Wt. 
(mg) 


Control 


0.0 


22.00 


24.6 


Targeted 


444 


1.2 


2.1 


Untargeted 


60 


7.3 


12.7 



5.2 Example 2 - Plastid Targeting of Cry2Ab Increases Frequency of Agronomically-Normal 
Plants Recovered from Transformation 

To obtain a commercially viable transgene-based insect control trait, it is crucial that an event 
with noimal plant growth characteristics be obtained. In most instances a fairly large number of 
independent transgenic events are advanced into field tests to insure that an event that meets all of the key 
criteria (effective insect control, normal Mendelian behavior of the transgene, and normal growth 
characteristics or agronomics) will be identified. Methods that increase the frequency with which nonnal 
events are obtained are clearly valuable as they increase the odds of identifying an event tiiat can be 
commereializcd. It is also useful to enlarge the pool size of prospective events for screening by 
increasing the percentage of Ro events (primary regenerated plants) with fertility. As plant transformation 
is labor intensive, any metiiod tiiat decreases tfie number of Ro events that must be produced in order to 
obtain a transgenic event witii appropriate perfoimance and growth characteristics is also valuable. 

Large populations of independent t^sgene Ro insertion events of the non-targeted pMON26800 
and PMON30463 vectors, and the targeted pMON30464 vector, were generated and scored for fertility. It 
was observed tf,at a higher percentage of the Ro events generated with the targeted vector were fertile 
(Table 8). Progeny of fertile Ro events were subsequently introduced into field tests where they were 
scored for European com borer resistance (ECBl) and normal segregation. 
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Methods for determination of ECB 1 ratings and segregation values were essentially as described 
(Amstrong et al, 1995). Events that passed the ECBl and segregation criteria were subsequently scored 
for stunting or height reductions. While 60% of the non-taigeted events displayed height reductions, only 
3% of the targeted events were stunted (Table 8). Improved fertility and reduced stunting resulted in 
significantly improved {21% vs. 8%) recovery of unstunted ECBl positive events with the tai^eted 
Cry2Ab vector. In summaiy. 4-fold more non-targeted Ro events must be produced and screened to 
obtain the same number of normal, ECB+ Ro events obtained with the targeted Cor2Ab vector in a 
transformation study. 

Table 8 

Comparisons of Percentage of Fertile, Stunted, and Normal Maize Plants Obtained with 



Vector 


# ECB LD + Ro 
Events* 


% Fertile 
Events^ 


% Stunted*^ 


% Normal, ECBl 


Untargeted 


192 


66 


63 


7 


Targeted 


78 


85 


4 


31 



53 



^"^ECB LD + Ro events are the # of Ro events that were positive by an ECB leaf disk feeding assay. 
Vo of the ECB LD+ Ro events yielding viable Rl progeny (seed). 

Stunted is the % of the ECBl positive and properly segregating events with reduced stature. 
(Total ECBl positive and properly segregating for non-targeted was 38; for targeted was 25). 
%% normal, ECBl + is the % normal. ECB+ events obtained relative to the total number of ECB 
LD+ Ro events screened. 

Example 3 - Plastid Targeting of Cry2Ab Increases Frequency of High Level European 
Com Borer Control in Transgenic Corn 

The previously described populations of independently transfomed events derived from both 
targeted (pMON30464) and non-targeted (pMON30463 or pMON26800) Ciy2Ab expression vectors 
were also screened for resistance to second generation European com borer infestations (ECB2). To 
facilitate these studies, the commercially efficacious transgenic com event MON810 (Yieldgard™) 
transfomied with the Qyl Ab gene was included as a positive control. Efficacy against ECB2 was tested 
in field tests essentially as described (Amistrong et al, 1995). In the 1996 field test, 18 independent non- 
targeted PMON26800 events were compared to MON810 (CrylAb). Of these 18 events, only one 
delivered ECB2 protection that was both statistically indistinguishable from MON810 and significamly 
less than the non-transgenic negative control (event UTl in Table 9). In the 1997 field test, 18 
independent targeted events (pMON30464) were tested in parallel with 3 non- targeted eveml (1 
PMON30463 event and the two pMON26800 events derived from the 1996 tests) and MON810 (Table 
10). Nine of the eighteen targeted pMON30464 events delivered ECB2 protection that was statistically 
indistinguishable from ECB2 protection conferred by the commercially efficacious CrylAb-expressing 
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MON810 (Yieldgard™) event and all had significantly less ECB2 damage than the non-transgenic 
ne^tive control (Table 10). 

These data sets indicate that the absolute number and frequency of commereiaily efficacious 
Ciy2Ab lines obtained firom the targeted pMON30464 vector is much greater than that obtained from the 
non-targeted pMON26800 vector. While 9 of 18 targeted C^^Ab events (50%) delivered ECB2 control 
that was both statistically indistinguishable from the MON810 CiylAb commercial standard and 
significantly less than the non-transgenic negative control, only 1 of 18 non-targeted Cry2Ab events (6%) 
displayed ECB2 control that was both statistically indistinguishable from the MON810 crylAb 
commercial standard and significantly less than the non-transgenic negative control. The superiority of 
the targeted Cry2Ab expression vector is especially evident if one considers that 9 commercially 
efficacious Cry2Ab events were obtained from a total of 78 ECB leaf disk positive Ro plants for an 1 1 .5% 
frequency of recovery while only 3 commercially efficacious Ciy2Ab events were obtained from a total 
of 1 92 ECB leaf disk feeding positive Ros for a 1 .6% recovery frequency (Rp ECB data from Table 6). 

Table 9 

Comparison "[f CK Protection in Untargeted (UT) Cry2Ab Transgenic Com Relative to 
MON810 CiTlAb Yieldgard™Transgenic Corn in Field Tests 

Stalk Tunneling (inches ) 

03" " 

0.7»-* 

1.9" 
2.0' 
2.5"' 
2.6" 
2.9" 
3.1" 
3.4" 
3.4" 
3.5" 
3.6" 
3.7" 
3.8" 
4.6" 
5.8" 
6.8« 
7.6' 
9.3' 
1ft i« 



Event 


Sample Size 


MON810(+ctrl.) 


20 


UTl 


10 


UT2 


10 


UT3 


10 


UT4 


10 


UTS 


8 


UT6 


10 


UT7 


10 


UTS 


10 


UT9 


10 


UTIO 


10 


UTll 


4 


Wild^pe 


10 


UT12 


10 


UT13 


10 


UT14 


10 


UT15 


10 


UT16 


10 


UTl 7 


10 


UTl 8 


10 
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a» b 

Values maiked with the same superscript (a) are statistically indistinguishable from MON810 in 
planned comparisons at />= 0.05. Values with superscripts (b) are statistically distinct. Events with 
stalk tunneling values significantly greater than the CrylAb commercial standard MON810 are 
shovra in boldfece. Genetic background of all events is identical (B73 x H99). 

c * 

• Values maiked with an asterisk are significantly lower than the wild-type non-transgenic negative 
control in planned comparisons with the neptive control (/»=0.05). Values marked with superscript 
(c) are significantly greater than the wild-type non-tiansgenic negative control in plamied 
comparisons with the negative control (/>=0.05). UT1-UTI8: Untargeted pMON26800 events #1- 
18. 

Table 10 

Comparison of ECB2 Protection in Targeted (T) and Untargeted (UT) Cry2Ab Tninsaenic 
Corn Relative to MON810 CrylAb Yieldgard ™ Transgenic Corn inPreld T^ts 



iLveiii 


Sample Size 


Stalk Tunneling (inches) 


Tl 
1 1 


9 


0.6" 


1 Z 


10 


0.6» 


iviv^iNoiu ctri.^ 


30 


0.9' 


i J 


1 A 

14 


1* 


T4 


12 


1.3" 


T5 


7 


1.4' 


UTl 


JO 


L6' 


T6 


13 


1.6' 


T7 


11 


1.6' 


T8 


10 


1.7" 


UT2 


10 


L8' 


T9 


10 


2.4' 


TIO 


12 


2.5'' 


Til 


7 


2.6'' 


T12 


9 


2.9" 


T13 


10 


3.2" 


T14 


11 


3.3" 


T15 


10 


3.5" 


T16 


10 


4.0" 


T17 


10 


4.3" 


UT3 


8 


£8" 


T18 


8 


5.4" 


wild type (-Ctrl.) 


20 


13.7' 
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Values marked with the superscript (a) are statistically indistinguishable from 
MON810 in planned comparisons at P= 0.05. Values with superscripts are statistically 
distinct. Events with stalk tunneling values significantly greater than the CrylAb 
commercial standard MON810 are shown in boldface; all transgenic events display 
significantly less tunneling than the wild type non-transgenic negative control in 
planned comparisons to the negative control (P=0.5). Genetic background of all 
events is identical (B73 x H99). 

T1-T18: Targeted PMON30464 events #1-18. UT1-UT3: Untargeted pMON30463 
and PMON26800 events #1-3. UTl in the 1997 field test is the same pMON26800 
event as UT3 in the 1996 field test. 
5.4 Example 4 - Plastid Targeting of the CrylAb Protein Results in Increased Expression in 
Transgenic Cotton CaUus Tissue 

Levels of the Cry2Ab protein in cotton callus tissue transformed with plastid-targeted and non- 
targeted Cry2Ab expression vectors were compared. Cry2Ab levels were significantly higher in callus 
that had been transformed with die plastid-targeted genes (Table 1 1). 

Plant expression vector pMON33830 contained a Cry2Ab expression cassette consisting of the 
following genetic elemems operably linked to produce functional C,y2Ab protein in plant cells: an 
enhanced CaMV 35S promoter, a petunia hsp70 5' untranslated leader, a synthetic cry2M> gene with a 
translation initiation codon (SEQ ID NO: I ), and transcription tennination and polyadenylation sequences 
from the nopaline synthase (NOS) gene of ^. tumefaciens. 

Plant expression vectors pMON33827 (SEQ ID N0:I3), pMON33828 (SEQ ID N014) and 
PMON33829 (SEQ ID N0:15) contained Cor2Ab expression cassettes similar to that occumng in 
PMON33830 except that in each a different chloroplast targeting sequence was tianslationally fused to the 
N-terminus of the synthetic crylAh gene. pMON33827 contained the coding sequence for PTPl (SEQ 
ID N0:5) which consists of an Arabidopsis thaliana ssRUBISCO (SSU) chloroplast targeting sequence 
and sequences coding for the first 24 amino acids of ssRUBISCO (SSU) protein (Wong et al, 1992). 
SEQID N0:6 represents the PTP] targeting peptide sequence. This peptide contains the completL native 
targeting sequence including the plastid targeting peptide cleavage site along with the first twenty-four 
amino acids of the mature RUBISCO SSU protein sequentially linked to a duplicated sequence of amino 
acids (SEQID N0:6 amino acids position No. 50-57) containing the RUBISCO SSU plastid targeting 
peptide cleavage site (SEQID NO:6 amino acids position No. 80-87). PTPl therefor contains a 
duplicated plastid targeting peptide cleavage site. The polynucleotide cassette containing this PTP coding 
sequence is linked at its 3' end to an Ncol restriction site which allows for insertions of coding sequences 
which are translationally in-frame with the PTP coding sequence, for example, those which encode 
Cry2Ab, Cry2Aa, variants of these, and other useful polypeptide encoding sequences. 
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PMON33828 contained the coding sequence for PTPIA (SEQ ID N0:7), a modification of PTPl 
in which the 24 amino acids of SSU between the two transit peptide cleavage sites was removed by 
cleavage with the restriction enzyme SpM, which cuts once within each copy of the transit peptide 
cleavage site, and re-ligation, resulting in the presence of only the transit peptide portion of PTPl 
followed by a single copy of the transit peptide cleavage site and an Ncol site. The peptide sequence for 
PTPIA is designated SEQ ID N0:8. 

PMON33829 contained the coding sequence for PTP2 (SEQ ID NO:9), the transit peptide 
sequence from the EPSP synthase gene of Arabidopsis thalicma. The peptide sequence for PTP2 is 
designated SEQ ID NO: 1 0. 

All of the above plant transformation expression vectors also contained a selectable marker gene 
cassette which confers kanamycin resistance to transformed plant cells. 

Cotton callus tissue from 12 randomly chosen, independent transgenic events from 
transformations with each of pMON33827. pMON33828, pMON33829 and pMON33830 was subjected 
to quantitative analysis of Cry2Ab protein levels using a quantitative ELISA assay. This ELISA used a 
direct sandwich technique that used a monoclonal capture antibody raised against Cry2Aa, a different 
Cry2Aa monoclonal antibody conjugated to alkaline phosphatase as the secondary antibody, and purified 
Cry2Aa protein as a standard. Comparison of Ciy2Ab expression levels in targeted and non-targeted 
callus tissue showed a significant increase in expression when a chloroplast targeting sequence was 
included (Table 1 1). PTPIA provided a significantly greater mean expression level when compared to 
non-targeted Cry2Ab as determined by applying a t test (t = 2.31, p = 0.03). PTP2 provided a 
significantly greater probability of obtaining callus lines expressing higher levels of Cry2Ab as 
determined by applying a G test (GVx^ = 5.6,/? = 0.02). 



Table 11 



Ci72Ab Levels in Independent Transformed Cotton Callus Lines Comparing Chloroplast-Targeted 

and Untargeted cr^lAb genes 



Cotton Callus Lines 


Cry2Ab ng/mL of Callus Extract 


Non-transformed callus 




Line 1 


0 


Line 2 


0 


Line 3 


0 


Line 4 


0 


PMON33827, PTPl-cry2Ab gene 




Line 1 


464 


Line 2 


61 
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Cotton Callus Lines 



Line 3 
Line 4 
Line S 
Line 6 
Line? 
Line 8 
Line 9 
Line 10 
Line 1 1 
Line 12 



pMON33828, PTPl A-c/3/2Ab Gene 



Cry2Ab ng/mL of Callus Extract 

0 ' 

25 

0 

368 

74 

101 

20 

652 

0 

0 



Line 1 


252 


Line 2 


235 


Line 3 


0 


Line 4 


416 


Line 5 


0 


Line 6 


0 


Line? 


0 


Line 8 


101 


Line 9 


393 


Line 10 


587 


Line 1 1 


788 


Line 12 


277 



pMON33829, PTP2-crK2Ab Gene 




Line I 


60 


Line 2 


0 


Line 3 


2220 


Line 4 


2036 


Line 5 


0 


Line 6 


38 


Line 7 


674 


Line 8 


2440 


Line 9 


15 


Line 10 


91 
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Cotton Callus Lines 

Line 1 1 



CryZAb ng/mL of Callus Extract 



Line 12 
pMON33830, crylAb Gene 



290 
71 



Line 1 


19 


Line 2 


166 


Line 3 


47 


Line 4 


20 


Line 5 


33 


Line 6 


47 


Line 7 


781 


Line 8 


35 


Line 9 


31 


Line 10 


0 


Line 1 1 


0 


Line 12 


136 



5.5 Example 5 - Targeting the Cry2Aa Protein to Plastids Results in Decreased Expression in 
Transgenic Cotton Callus Tissue 

In contrast to Example 4 above, and exemplifying that the increase in expression obtained usbg 
plastid targeting sequences is specific to particular cry genes, the inventors discovered that the same 
plastid targeting sequences described above, PTPl, PTPIA and PTP2, resulted in significantly lower 
levels of expression of the closely related cry2Aa gene in transgenic cotton callus (Table 12) Plant 
expression vector pMON33803 contained a cr,2Aa expression cassette consisting of the following 
genetic elements opeiably linked to produce functional Cry2Aa protein in plant cells: a FMV35S 
promoter, a petunia heat shock HSP70 5' untranslated leader, a synthetic crv2Aa gene (SEQ ID NO. l 1) 
with a translation initiation codon and Ncol restriction enzyme site at the 5'-end, and transcription 
termination and polyadenylation sequences from the E9 SSU gene from pea. The peptide sequence for 
the C,y2Aa protein is designated SEQ ID N0:12. pMON33812, pMON3381I, and pMON33806 
contained cry2Aa expression cassettes similar to that occurring in pMON33803 except that in each case a 
different chloroplast targeting sequence (PTPl, PTPIA. and PTP2, respectively) was transitionally fused 
to the N-terminus of the synthetic crv2Aa gene. All of these vectors also contained a selectable marker 
gene cassette conferring glyphosate resistance to transformed plant cells. 

Cotton callus tissue from 10 randomly chosen independent transgenic events from 
transformations with each of pMON33803, pMON33812, pMON3381 1 and pMON33806 was subjected 
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to quantitative analysis of Cry2Aa protein levels using the quantitative Cry2 ELISA assay. Comparison 
of Ciy2Aa expression levels in targeted and non-targeted callus tissue showed a significant decrease in 
expression when chloroplast targeting sequences were included (Table 12). The non-targeted cry2Aa 
gene conferred expression levels that differed significantly from those achieved using any of the three 
plastid targeted cry2Aa genes, as determined by using a Tukey-Kramer HSD test (a = 0.05). 

Table 12 

Cry2Aa Levels In Independent Transformed Cotton Callus Lines Comparing 
Chloroplast-Tai^eted And Untargeted Cry2Aa Genes 

Cotton Callus Lines 



Non-transformed callus 
Line 1 
Line 2 
Line 3 
Line 4 



pMON33812, PTPl-Cry2Aa Gene 



pMON33811. PTPIA -Cry2Aa Gene 



Cry2Aa ng/mL of Extract 



0 
0 
0 
0 



Line 1 


29 


Line 2 


32 


Line 3 


22 


Line 4 


41 


Line 5 


24 


Line 6 


47 


Line? 


43 


Line 8 


49 


Line 9 


0 


Line 10 


23 



Line 1 


0 


Line 2 


59 


Line 3 


48 


Line 4 


72 


Line 5 


29 


Line 6 


37 


Line? 


44 


Line 8 


32 


Line 9 


20 
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Cotton Callus Lines 


Cry2Aa ng/mL of Extract 


Line 10 


0 


PMON33806, PlP2-CTy2Aa Gene 




Line 1 


27 


Line 2 


0 


Lines 


10 


Line 4 


84 


Line S 


205 


Line 6 


0 


Line 7 


13 


Line 8 


6 


Line 9 


0 


Line 10 


8 


PMON33803, Cry2Aa Gene 




Line 1 


63 


Line 2 


2278 


Line 3 


181 


Line 4 


3131 


Line 5 


3752 


Line 6 


851 


Line 7 


303 


Line 8 


1365 


Line 9 


1601 


Line 10 


1648 



5.6 



10 



Example 6 - Targeting the Cry2Aa Protein to Plastids Results in Decreased Expression and 
Increased Phytotoxicity in Transgenic Tobacco Plants 

Transfoimed tobacco plants were generated using pMON33803, the non-targeted co'2Aa plant 
expression vector and pMON33806. the chloroplast-targeted PTP2-cry2Aa plant expression vector. Leaf 
tissue samples of equivalent weight from 48 pMON33803 plants and 41 pMON33806 plants were 
extracted in equal volumes of extraction buffer and the relative levels of cry2Aa were determined using a 
qualitative ELISA (Table 13). This ELISA used a direct sandwich technique that used polyclonal capture 
antibody raised against CrylM, the same polyclonal antibody conjugated to alkaline phosphatase as the 
secondary antibody, and purified Cry2Aa protein as a standard. 

The proportion of the total number of plants recovered from transformation that expressed non- 
targeted Cry2Aa at high levels was greater than the proportion of plams recovered that expressed targeted 
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Ciy2Aa at high levels. Conversely, the proportion of the total number of plants recovered from 
transformation that failed to express detectable targeted Cry2Aa was greater than the proportion of plants 
recovered that failed to express non-targeted Ciy2Aa. All of the PTP2-Cry2Aa plants that had detectable 
levels of Cry2Aa expression exhibited a severely abnormal phenotype; these plants were extremely 
stunted, had shortened intemodes, had deformed, wrinkled leaves, and were infertile. All of the PTP2- 
Cry2Aa plants that lacked Co'2Aa expression appeared normal. In contrast, only some of the high 
expressing non-targeted Cry2Aa plants displayed a stunted phenotype. 



Table 13 

Cry2Aa Levels in Independent Transformed Tobacco Plants Comparing 
Chloroplast-Targeted and Untargeted ciy2Aa Genes 



Transgenic Plants 


ELISA O.D. 


Transgenic Plants 


EUSA OJD. 


PMON33 803 cr^a Aa 




pMON33806 




gene 




PTP2-cr>'2Aa gene 




Plant 1 


2.5 


Plant 1 


0 


Plant 2 


I.l 


Plant 2 


0 


Plant 3 


2.7 


Plant 3 


0 


Plant 4 


0.1 


Plant 4 


0 


Plants 


0.1 


Plants 


0 


Plant 6 


2.3 


Plant 6 


0.9 


Plant? 


1.9 


Plant? 


0.4 


Plants 


2.4 


Plants 


0.4 


Plant 9 


0 


Plant 9 


0 


Plant 10 


2.1 


Plant 10 


0.6 


Plant 1 ] 


0.1 


Plant 1 1 


0 


Plant 12 


0.5 


Plant 12 


0.5 


Plant 13 


2.4 


Plant 13 


0.4 


Plant 14 


O.I 


Plant 14 


0.7 


Plant 15 


2.2 


Plant 15 


1.5 


Plant 16 


0.2 


Plant 16 


0.6 


Plant 17 


2.6 


Plant 1? 


0 


Plant 18 


2.5 


Plant IS 


0 


Plant 19 


2.5 


Plant 19 


0 


Plant 20 


1.4 


Plant 20 


0 


Plant 21 


2.4 


Plant 21 


0 


Plant 22 


2.1 


Plant 22 


0 
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Transgenic Plants ELISA OJ). 


Transgenic Plants 




"lani Zj 


U.D 


Plant 23 


u.o 


Plant 24 


2.1 


Plant 24 


u 


Plant 2S 


0.3 


Plant 25 


A 

u 


Plant 7f\ 


0 


Plant 26 


U.o 


Plant 97 


0.3 


Plant 27 


A 

U 


Plant OH 


2.2 


Plant 28 


U./ 


Plant 90 


0 


Plant 29 


0.5 


Plant '^n 


1.5 


Plant 30 


0 


Plant 1 


0.1 


Plant 31 


0 


Plant 79 


0.1 


Plant 32 


0 


Plant 'X'X 


0.7 


Plant 33 


A 
0 


Plant "^A 


0 


Plant 14 


0 


Plant 7^ 


0 


Plant IS 


0 


Plant 36 


0 


Plant m 


A 

0 


Plant 37 


0.2 


riant J 7 


0 


Plant 38 


2.1 


riant Jo 


0 


Plant 39 


0 


riant 39 


0 


Plant 4Q 


1.9 


riant 40 


0 


Plant 41 


1.5 


Din..-* A 1 

riant 41 


0 


Plant 42 


2.8 






Plant 43 


0.6 






Plant 44 


2.1 






Plant 45 


0.9 






Plant 46 


0 






Plant 47 


0 






Plant 48 


0 







5.7 Example 7 - Transformatioii of Tobacco Chloroplast with a CrylAb gene 

Recombinant plants can be produced in which only the mitochondrial or chloroplast DNA has 
been altered to incoiporate the molecules envisioned in this application. Promoters which function in 
chloroplasts have been known in the art (Hanley-Bowden et al.. Trends in Biochemical Sciences 12:67- 
70, 1987). Methods and compositions for obtaining cells containing chloroplasts into which heterologous 
DNA has been inserted have been described, for example by Daniell et al. (U.S. Pat. No. 5,693,507 
1997)andMaligaetal.(U.S.Pat.No. 5,451,513; 1995). A vector can be constructed which contains an 
expression cassette from which a Cry2A protein could be produced. A cassette could contain a 
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chloroplast operable promoter sequence driving expression of a crylP. costal protein gene, constructed in 
much the same manner as other polynucleotides herein, using thermal amplification methodologies 
restriction endonuclease digestion, and ligation etc. A chloroplast expressible gene would provide a 
promoter and a 5' untranslated region from a heterologous gene or chloroplast gene such as^A, which 
would provide for transcription and translation of a DNA sequence encoding a Cry2A protein in the 
chloroplast; a DNA sequence encoding Cry2A protein; and a transcriptional and translational termination 
region such as a 3' inverted repeat region of a chloroplast gene that could stabilize an expressed cnQ.K 
mRNA. Expression from within the chloroplast would enhance crylK gene product accumulation. A 
host cell containing chloroplasts or plastids can be transformed with the expression cassette and then the 
resulting cell containing the transformed chloroplasts can be grown to express the Co^A protein. A 
cassette may also include an antibiotic, herbicide tolerance, or other selectable marker gene in addition to 
the crylK gene. The expression cassette may be flanked by DNA sequences obtained from a chloroplast 
DNA which would facilitate stable integration of the expression cassette into the chloroplast genome 
particularly by homologous recombination. Alternatively, the expression cassette may not integrate, bui 
by mcluding an origin of replication obtained from a chloroplast DNA, would be capable of providing for 
replication of the heterologous crv2A gene in the chloroplast. Plants can be generated from cells 
containing transformed chloroplasts and can then be grown to produce seeds, from which additional 
plants can be generated. Such transformation methods are advantageous over nuclear genome 
transformation, in particular where chloroplast transformation is effected by integration into the 
chloroplast genome, because chloroplast genes in general are maternally inherited. This provides 
environmentally "safer" transgenic plants, virtually eliminating the possibility of escapes into the 
environment. Furthermore, chloroplasts can be transformed multiple times to produce functional 
chloroplast genomes which express multiple desired recombinant proteins, whereas nuclear genomic 
transformation has been shown to be rather limited when multiple genes are desired. Segregational events 
are thus avoided using chloroplast or plastid transformation. Unlike plant nuclear genome expression, 
expression in chloroplasts or plastids can be initiated from only one promoter and continue through a 
polycistronic region to produce multiple peptides from a single mRNA. 

The expression cassette would be produced in much the same way that other plant transformation 
vectors are constructed. Plant chloroplast operable DNA sequences can be inserted into a bacterial 
plasmid and linked to DNA sequences expressing desired gene products, such as C,y2A proteins, so that 
Cry2A protein is produced within the chloroplast. obviating the requirement for nuclear gene regulation, 
capping, splicing, or polyadenylation of nuclear regulated genes, or chloroplast or plastid targeting 
sequences. An expression cassette comprising a aylK gene, which is either synthetically constructed or 
a native gene derived directly from a B. thuringiensis genome or a B. thuringiensis episomal element 
would be inserted into a restriction site in a vector constructed for the purpose of chloroplast or plastid' 
transformation. The cassette would be flanked upstream by a chloroplast or plastid functional promoter 
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and downstream by a chbroplast or plastid functional transcription and translation termination sequence. 
The resulting cassette could be incoiporated into the chloroplast or plastid genome using well known 
homologous recombination methods. 

Alternatively, chloroplast or plastid transformation could be obtained by using an autonomously 
replicating plasmid or other vector capable of propagation within the chloroplast or plastid. One means of 
effectuating this method would be to utilize a portion of the chloroplast or plastid genome required for 
chloroplast or plastid replication initiation as a means for maintaining the plasmid or vector in the 
transformed chloroplast or plastid. A sequence enabling stable replication of a chloroplast or plastid 
epigenetic element could easily be identified from random cloning of a chloroplast or plastid genome into 
a standard bacterial vector which also contains a chloroplast or plastid selectable marker gene, followed 
by transformation of chloroplasts or plastids and selection for transformed cells on an appropriate 
selection medium. Introduction of an expression cassette as described herein into a chloroplast or plastid 
replicable epigenetic element would provide an effective means for localizing a Cry2A B. Ihurmgiensis 5- 
endotoxin to the chloroplast or plastid. 
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All of the compositions and methods disclosed and claimed herein can be made and executed 
without undue experimentation in light of the present disclosure. While the compositions and methods of 
this invention have been described in terms of preferred embodiments, it will be apparent to those of skill 
in the art that variations may be applied to the compositions and methods and in the steps or in the 
sequence of steps of the method described herein without departing from the concept, spirit and scope of 
the invention. More specifically, it will be apparent that certain agents which are both chemically and 
physiologically related may be substituted for the agents described herein while the same or similar 
results would be achieved. All such similar substitutes and modifications apparent to those skilled in the 
art are deemed to be within the spirit, scope and concept of the invention as defined by the appended 



10 claims 



wo 00/26371 

PCT/US99/26086 

-67- 

Claims: 

1 . A plant comprising a nucleic acid sequence comprising a plant functional promoter 
sequence operably linked to a polynucleotide sequence encoding a Cry2A Bacillus 
thuringiemis 8-endotoxin protein which lacks substantial Dipteran species inhibitory 
activity, wherein expression of said nucleic acid sequence in said plant yields said 
protein localized to a subcellular organelle or compartment 

2. A plant comprising a nucleic acid sequence comprising a plant functional promoter 
sequence operably linked to a first polynucleotide sequence encoding a plastid transit 
peptide, which is linked in-frame to a second polynucleotide sequence encoding a Ciy2A 
Bacillus thuringiensis 5-endotoxin protein which lacks substantial Dipteran species 
inhibitoiy activity, wherein said second polynucleotide is operably linked to a plant 
fiinctional 3' end transcription termination and polyadenylation sequence, wherein 
expression of said nucleic acid sequence in said plant yields a fusion protein comprised 
of an amino-terminal plastid transit peptide covalently linked to said 6-endotoxin protein, 
and wherein said fusion protein functions to localize said 6-endotoxin protein to a 
subcellular organelle or compartment. 

3. The plant of claim 1 , wherein said polynucleotide sequence encoding a Cry2A 
Bacillus thuringiensis 5-endotoxin protein comprises a sequence encoding a Cry2Ab 
Bacillus thuringiensis 8-endotoxin protein. 

4. The plant of claim 2, wherein said polynucleotide sequence encoding a Cry2A 
Bacillus thuringiensis 8-endotoxin protein comprises a sequence encoding a Ciy2Ab 
Bacillus thuringiensis S-endotoxin protein. 

5. The plant of claim 1 , wherein said subcellular organelle or compartment is a 
plant plastid or chloroplast. 

6. The plant of claim 2, wherein said subcellular organelle or compartment is a 
plant plastid or chloroplast. 

7. The plant of claim 1 , wherein said nucleic acid sequence is introduced into and stably 
maintained within a plant plastid or chloroplast. 

{. A plant tissue derived from progeny of the plam according to claim 5, wherein said plant 

tissue comprises a plant, plant seed, or plant cells containing said polynucleotide 

sequence encoding said 5-endotoxin protein 
». A plant tissue derived from progeny of the plant according to claim 6. wherein said plant 

tissue comprises a plant, plant seed, or plant cells containing said polynucleotide 

sequence encoding said 5-endotoxin protein. 

The plant according to claim 1, wherein said nucleic acid sequence comprising a 
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promoter is a plant chloroplast or plastid functional promoter. 

1 1 . The plant according to claim 2, wherein said nucleic acid sequence comprising a plant 
functional promoter is a promoter sequence which is naturally expressed in plants. 

12. The plant according to claim 1 , wherein said polynucleotide sequence 

encoding a Cry2A Bacillus thwingiensis 5-endotoxin protein is selected from the group 
consisting of SEQ ID NO: 1 , SEQ ID NO: 11, and SEQ ID NO: 1 7. 

13. The plant according to claim 2, wherein said polynucleotide sequence 

encoding a Cry2A Bacillus thwingiensis 5-endotoxin protein is selected from the group 
consisting of SEQ ID NO: 1 , SEQ ID NO: 1 1 , and SEQ ID NO: 1 7. 

14. The plant according to claim 1 , wherein said Cry2Ab Bacillus thuringiensis 5-endotoxin 
protein is selected from the group consisting of SEQ ID NO:2, SEQ ID NO: 12, and SEQ 
ID NO: 18. 

15. The plant according to claim 2, wherein said Cry2Ab Bacillus thuringiensis 5-endotoxin 
protein is selected from the group consisting of SEQ ID NO:2, SEQ ID NO: 12, and SEQ 
ID NO: 18. 

16. The plant of claim 2, wherein said nucleic acid sequence further comprises a plant 
functional intron sequence. 

17. The plant of claim 1 6, wherein said intron sequence is selected from the group consisting 
of Adh intron 1 , sucrose synthase intron, TMV omega element, maize Heat Shock Protein 
70 intron, and the rice Actl intron. 

18. The plant of claim 1 6, wherein said intron sequence is the maize Heat Shock Protein 70 
intron. 

19. The plant of claim 2, wherein first polynucleotide sequence encodes a plastid transit 
peptide selected from the group consisting of zmSSU FTP, PTPl, PTPl A , and PTP2. 

20. The plant of claim 1 9, wherein said zmSSU PTP plastid transit peptide comprising SEQ 
ID NO:4 is encoded by the nucleic acid sequence comprising SEQ ID N0:3. 

21 . The plant of claim 1 9, wherein said PTP 1 plastid transit peptide comprising SEQ ID 
N0:6 is encoded by the nucleic acid sequence comprising SEQ ID N0:5. 

22. The plant of claim 19, wherein said PTPIA plastid transit peptide comprising SEQ ID 
N0:8 is encoded by the nucleic acid sequence comprising SEQ ID NO:7, 

23. The plant of claim 1 9, wherein said PTP2 plastid transit peptide comprising SEQ ID 
NO: 10 is encoded by the nucleic acid sequence comprising SEQ ID NO:9. 

24. The plant of claim 2, comprising nucleotides 1 7 to 3 1 82 of SEQ ID NO: 1 3 

25. The plant of claim 2, comprising nucleotides 1 7 to 3092 of SEQ ID NO: 1 4. 

26. The plant of claim 2, comprising nucleotides 17 to 3 155 of SEQ ID NO: 15. 
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27. The plant of claim 1 , wherein the plant is a monocotyledonous plant 

28. The plant of claim 2, wherein the plant is a monocotyledonous plant. 

29. The plant of claim 27, wherein the plant is a monocotyledonous plant selected from the 
group consisting of maize, rice, wheat, barley, oats, rye, millet, sorghum, sugarcane, and 
turfgrass. 

30. The plant of claim 28, wherein the plant is a monocotyledonous plant selected from the 
group consisting of maize, rice, wheat, barley, oats, rye, millet, sorghum, sugarcane, and 
turfgrass. 

3 1 . The plant of claim 1 , wherein said plant is a dicotyledonous plant. 

32. The plant of claim 2, wherein said plant is a dicotyledonous plant. 

33. The plant of claim 3 1 , wherein the plant is a dicotyledonous plant selected from the 
group consisting of cotton, soybean, tomato, potato, citrus, tobacco, canola, and 
strawberry. 

34. The plant of claim 32, wherein the plant is a dicotyledonous plant selected from the 
group consisting of cotton, soybean, tomato, potato, citrus, tobacco, canola, and 
strawberry. 

35. The plant of claim 1 , further comprising an Ro transgenic plant. 

36. The plant of claim 2, further comprising an Ro transgenic plant. 

37. A progeny plant of any generation of the plant of claim 35, wherein said plant has 
inherited said nucleic acid sequence from said Ro transgenic plant. 

38. A progeny plant of any generation of the plant of claim 36, wherein said plant has 
inherited said nucleic acid sequence from said Ro transgenic plant. 

39. The plant according to claim 1, wherein said plant further comprises an additional nucleic acid 
sequence comprising a plant operable promoter linked to a polynucleotide sequence encoding a 
Cryl B. thvringiemis S-endotoxin protein 

40. The plant according to claim 2, wherein said plant further comprises an additional nucleic acid 
sequence comprising a plant operable promoter linked to a polynucleotide sequence encoding a 
Cryl A thuringiensis 6-endotoxin protein. 

41 . A method of producing a transgenic progeny plant comprising: 

(a) obtaining a first plant containing a nucleic acid sequence comprising a plant functional 
promoter operably linked to a first polynucleotide sequence encoding a plastid transit 
peptide, which is linked in frame to a second polynucleotide sequence encoding a Cry2A 
Bacillus thuringiensis 5-endotoxin protein lacking substantial Dipteran species inhibitory 
activity, wherein said second polynucleotide is operably linked to a plant functional 3' 
end transcription termination and polyadenylation sequence, wherein expression of said 
nucleic acid sequence in said plant yields a fusion protein comprising an amino-terminal 
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plastid transit peptide covalently linked to said 5-endotoxin protein, and wherein said 
fusion protein functions to localize said S-endotoxin protein to a subcellular organelle or 
compartment; ^ 

(b) obtaining a second plant; and 

(c) crossing said first and second plants to obtain a crossed transgenic progeny plant, said 
progeny plant having inherited said nucleic acid sequence from said first plant 

42. The method ofclaim 41, wherein said progeny plant is a monocotyledonous plant, said 
monocotyledonous plant being selected from the group consisting of maize, rice, wheat, 
barley, oats, rye, millet, sorghum, sugarcane, and turfgrass. 

43. The method of claim 41 , wherein said progeny plant is a dicotyledonous plant, said 
dicotyledonous plant being selected from the group consisting of cotton, soybean, 
tomato, potato, citrus, and tobacco. 

44. A nucleic acid sequence comprising a promoter operably linked to a first polynucleotide 
sequence encoding a plastid transit peptide, which is linked in frame to a second 
polynucleotide sequence encoding a Cry2A Bacilltis thta-ingiensis 5-endotoxin protein 
lacking substantial Dipteran inhibitory activity, wherein expression of said nucleic acid 
sequence by a plant cell produces a fusion protein comprising an amino-terminal plastid 
transit peptide covalently linked to said 5-endotoxin protein, and wherein said fusion 
protein functions to localize said 5-endotoxin protein to a subcellular organelle or 
compartment. 

45. The nucleic acid sequence of claim 44, wherein said second polynucleotide sequence 
encodes a Cry2Ab Bacillus thuringtensis 8-endotoxin protein. 

46. The nucleic acid sequence of claim 45, wherein said second polynucleotide sequence 
encodes a Cry2Ab Bacillus thuringiensis 5-endotoxin protein selected from the group of 
sequences consisting of SEQ ID NO:2 and SEQ ID NO: 18. 

47. The nucleic acid sequence of claim 46, wherein said second polynucleotide sequence is 
selected from the group of sequences consisting of SEQ ID NO: 1 and SEQ ID NO: 1 7. 

48. A plant cell comprising a nucleic acid sequence comprising a promoter operably linked 
to a polynucleotide sequence encoding a Cry2A Bacillus thuringiensis 5-endotoxin 
protein lacking substantial Dipteran inhibitory activity, wherein expression of said 
nucleic acid sequence in said plant yields said protein localized to a subcellular organelle 
or compartment. 

49. The plant cell of claim 48, wherein said polynucleotide sequence encoding a Cry2A 
Bacillus thuringiensis 5-endotoxin protein encodes a Cry2Ab Bacillus thuringiensis 6- 
endotoxin protein. 
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50. The plant cell of claim 49, wherein said subcellular organelle or compartment is a plant 
plastid or chloroplast. 

5 1 . The plant cell of claim 48, wherein said nucleic acid sequence is introduced into and 
stably maintained within a plant plastid or chloroplast. 

52. The plant cell of claim 51 wherein said nucleic acid sequence is expressed within said 
plastid or chloroplast, said expression producing an insecticidally effective amount of 
said S-endotoxin protein localized to said plastid or chloroplast. 

53. A plant tissue derived from progeny of the plant cell according to claim 48, wherein said 
plant tissue comprises a plant, plant seed, plant cells or progeny tissues thereof 
containing said polynucleotide sequence expressing said S-endotoxin localized to a plant 
plastid or chloroplast. 

54. The plant cell according to claim 50, wherein said nucleic acid sequence comprising a 
promoter is a plant chloroplast or plastid functional promoter. 

55. The plant cell according to claim 54, wherein said polynucleotide sequence encoding a 
Cry2A Bacillus thuringiensis 5-endotoxin is selected from the group consisting of SEQ 
ID NO: 1 , SEQ ID NO: 1 1 , and SEQ ID NO: 1 7. 

56. The plant cell according to claim 55, wherein said Cry2Ab Bacillus thuringiensis 6- 
endotoxin is selected from the group consisting of SEQ ID NO:2, SEQ ID NO: 12, and 
SEQ ID NO: 18. 
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SEQUENCE LISTING 

<110> David R. Corbin 

Charles P. Romano 

<120> Improved Method for Transforming Plants to Express 
del ta - Endotoxins 

<130> 38-21(13547) 

<140> Application No. 09/186,002 
<141> Filed 11/04/98 

<160> 18 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1934 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Completely Synthesized 
<400> 1 

ccatggacaa ctccgtcctg aactctggtc gcaccaccat ctgcgacgcc tacaacgtcg 
cggcgcatga tccattcagc ttccagcaca agagcctcga cactgttcag aaggagtgga 
cggagtggaa gaagaacaac cacagcctgt acctggaccc catcgtcggc acggtggcca 
gcttccttct caagaaggtc ggctctctcg tcgggaagcg catcctctcg gaactccgca 
acctgatctt tccatctggc tccaccaacc tcatgcaaga catcctcagg gagaccgaga 
agtttctcaa ccagcgcctc aacactgata cccttgctcg cgtcaacgct gagctgacgg 
gtctgcaagc aaacgtggag gagttcaacc gccaagtgga caacttcctc aaccccaacc 
gcaatgcggt gcctctgtcc atcacttctt ccgtgaacac catgcaacaa ctgttcctca 
accgcttgcc tcagttccag atgcaaggct accagctgct cctgctgcca ctctttgctc 
aggctgccaa cctgcacctc tccttcattc gtgacgtgat cctcaacgct gacgagtggg 
gcatctctgc agccacgctg aggacctacc gcgactacct gaagaactac accagggact 
actccaacta ttgcatcaac acctaccagt cggccttcaa gggcctcaat acgaggcttc 
acgacatgct ggagttcagg acctacatgt tcctgaacgt gttcgagtac gtcagcatct 
ggtcgctctt caagtaccag agcctgctgg tgtccagcgg cgccaacctc tacgccagcg 
gctctggtcc ccaacaaact cagagcttca ccagccagga ctggccattc ctgtattcgt 
tgttccaagt caactccaac tacgtcctca acggcttctc tggtgctcgc ctctccaaca 
ccttccccaa cattgttggc ctccccggct ccaccacaac tcatgctctg cttgctgcca 
gagtgaacta ctccggcggc atctcgagcg gcgacattgg tgcatcgccg ttcaaccaga 
acttcaactg ctccaccttc ctgccgccgc tgctcacccc gttcgtgagg tcctggctcg 
acagcggctc cgaccgcgag ggcgtggcca ccgtcaccaa ctggcaaacc gagtccttcg 
agaccaccct tggcctccgg agcggcgcct tcacggcgcg tgggaattct aactacttcc 
ccgactactt catcaggaac atctctggtg ttcctctcgt cgtccgcaac gaggacctcc 
gccgtccact gcactacaac gagatcagga acatcgcctc tccgtccggg acgcccggag 
gtgcaagggc gtacatggtg agcgtccata acaggaagaa caacatccac gctgtgcatg 
agaacggctc catgatccac ctggcgccca atgattacac cggcttcacc atctctccaa 
tccacgccac- ccaagtgaac aaccagacac gcaccttcat ctccgagaag ttcggcaacc 
agggcgactc cctgaggttc gagcagaaca acaccaccgc caggtacacc ctgcgcggca 
acggcaacag ctacaacctg tacctgcgcg tcagctccat tggcaactcc accatcaggg 
tcaccatcaa cgggagggtg tacacagcca ccaatgtgaa cacgacgacc aacaatgatg 
gcgtcaacga caacggcgcc cgcttcagcg acatcaacat tggcaacgtg gtggccagca 
gcaactccga cgtcccgctg gacatcaacg tgaccctgaa ctctggcacc cagttcgacc 
tcatgaacat catgctggtg ccaactaaca tctcgccgct gtactgatag gagctctgat 
ccccatggga attc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1934 



wo 00/26371 



PCT/US99/26086 



<210> 2 
<211> 634 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 2 

Met Asp Asn Ser Val Leu Asn Ser Gly Arg Thr Thr lie Cys Asp Ala 

15 10 15 

Tyr Asn Val Ala Ala His Asp Pro Phe Ser Phe Gin His Lys Ser Leu 

20 25 30 

Asp Thr Val Gin Lys Glu Trp Thr Glu Trp Lys Lys Asn Asn His Ser 

35 40 45 

Leu Tyr Leu Asp Pro lie Val Gly Thr Val Ala Ser Phe Leu Leu Lys 

50 55 60 

Lys Val Gly Ser Leu Val Gly Lys Arg lie Leu Ser Glu Leu Arg Asn 
65 70 75 80 

Leu lie Phe Pro Ser Gly Ser Thr Asn Leu Met Gin Asp lie Leu Arg 

85 90 95 

Glu Thr Glu Lys Phe Leu Asn Gin Arg Leu Asn Thr Asp Thr Leu Ala 

100 105 110 

Arg Val Asn Ala Glu Leu Thr Gly Leu Gin Ala Asn Val Glu Glu Phe 

115 120 125 

Asn Arg Gin Val Asp Asn Phe Leu Asn Pro Asn Arg Asn Ala Val Pro 

130 135 140 

Leu Ser lie Thr Ser Ser Val Asn Thr Met Gin Gin Leu Phe Leu Asn 
145 150 155 160 

Arg Leu Pro Gin Phe Gin Met Gin Gly Tyr Gin Leu Leu Leu Leu Pro 

165 170 175 

Leu Phe Ala Gin Ala Ala Asn Leu His Leu Ser Phe He Arg Asp Val 

180 185 190 

lie Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Leu Arg Thr 

195 200 205 

Tyr Arg Asp Tyr Leu Lys Asn Tyr Thr Arg Asp Tyr Ser Asn Tyr Cys 

210 215 220 

He Asn Thr Tyr Gin Ser Ala Phe Lys Gly Leu Asn Thr Arg Leu His 
225 230 235 240 

Asp Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr 

245 250 255 

Val Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser 

260 265 270 

Gly Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser 

275 280 285 

Phe Thr Ser Gin Asp Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn 

290 295 300 

Ser Asn Tyr Val Leu Asn Gly Phe Ser Gly Ala Arg Leu Ser Asn Thr 
305 310 315 320 

Phe Pro Asn He Val Gly Leu Pro Gly Ser Thr Thr Thr His Ala Leu 

325 330 335 

Leu Ala Ala Arg Val Asn Tyr Ser Gly Gly He. Ser Ser Gly Asp He 

340 345 350 

Gly Ala Ser Pro Phe Asn Gin Asn Phe Asn Cys Ser Thr Phe Leu Pro 

355 360 365 

Pro Leu Leu Thr Pro Phe Val Arg Ser Trp Leu Asp Ser Gly Ser Asp 

370 375 380 

Arg Glu Gly Val Ala Thr Val Thr Asn Trp Gin Thr Glu Ser Phe Glu 
385 390 395 400 

Thr Thr Leu Gly Leu Arg Ser Gly Ala Phe Thr Ala Arg Gly Asn Ser 

405 410 415 

Asn Tyr Phe Pro Asp Tyr Phe He Arg Asn He Ser Gly Val Pro Leu 
420 425 430 
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Val Val Arg Asn 
435 

Arg Asn lie Ala 
450 

Met Val Ser Val 

465 

Asn Gly Ser Met 

lie Ser Pro lie 
500 

lie Ser Glu Lys 
515 

Asn Asn Thr Thr 
530 

Asn Leu Tyr Leu 
545 

Thr lie Asn Gly 

Asn Asn Asp Gly 
580 

lie Gly Asn Val 
595 

Asn Val Thr Leu 
610 

Leu Val Pro Thr 

625 

<210> 3 
<211> 415 
<212> DNA 
<213> Zea mays 



Glu Asp 


Leu 


Arg 






440 


Ser Pro 


Ser 


Gly 




455 




His Asn 


Arg 


Lys 


470 






lie His 


Leu 


Ala 


485 






His Ala 


Thr 


Gin 


Phe Gly 


Asn 


Gin 






520 


Ala Arg 


Tyr 


Thr 




535 




Arg Val 


Ser 


Ser 


550 






Arg Val 


Tvr 


Thr 


565 






Val Asn 


Asp 


Asn 


Val Ala 


Ser 


Ser 






600 


Asn Ser 


Gly 


Thr 




615 




Asn lie 


Ser 


Pro 



630 



Arg 


Pro 


Leu 


His 


Thr 


Pro 


Gly 


Gly 








460 


Asn 


Asn 


He 


His 






475 




Pro 


Asn 


Asp 


Tyr 




490 






Val 


Asn 


Asn 


Gin 


505 








Gly 


Asp 


Ser 


Leu 


Leu 


Arg 


Gly 


Asn 








540 


He 


Gly 


Asn 


Ser 






555 




Ala 


Thr 


Asn 


Val 




570 






Gly 


Ala 


Arg 


Phe 


585 








Asn 


Ser 


Asp 


Val 


Gin 


Phe 


Asp 


Leu 








620 


Leu 


Tyr 







Tyr Asn Glu He 
445 

Ala Arg Ala Tyr 

Ala Val His Glu 
480 

Thr Gly Phe Thr 
495 

Thr Arg Thr Phe 
510 

Arg Phe Glu Gin 

525 

Gly Asn Ser Tyr 

Thr He Arg Val 
560 

Asn Thr Thr Thr 

575 

Ser Asp He Asn 
590 

Pro Leu Asp He 
605 

Met Asn He Met 



<400> 3 

tctagaggat cagcatggcg cccaccgtga tgatggcctc gtcggccacc gccgtcgctc 

cgttcctggg gctcaagtcc accgccagcc tccccgtcgc ccgccgctcc tccagaagcc 

tcggcaacgt cagcaacggc ggaaggatcc ggtgcatgca ggtaacaaat gcatcctagc 

tagtagttct ttgcattgca gcagctgcag ctagcgagtt agtaatagga agggaactga 

tgatccatgc atggactgat gtgtgttgcc catcccatcc catcccattt cccaaacgaa 

ccgaaaacac cgtactacgt gcaggtgtgg ccctacggca acaagaagtt cgagacgctg 

tcgtacctgc cgccgctgtc gaccggcggg cgcatccgct gcatgcaggc catgg 

<210> 4 
<211> 79 

<212> PRT 
<213> Zea mays 

<400> 4 



Met 


Ala 


Pro 


Thr 


Val Met Met Ala Ser Ser Ala 


Thr 


Ala 


Val 


Ala 


Pro 


1 








5 10 








15 




Phe 


Leu 


Gly Leu 


Lys Ser Thr Ala Ser Leu Pro 


Val 


Ala 


Arg 


Arg 


Ser 








20 


25 






30 






Ser 


Arg 


Ser 


Leu 


Gly Asn Val Ser Asn Gly Gly 


Arg 


He 


Arg 


Cys 


Met 






35 




40 




45 








Gin 


Val 


Trp 


Pro 


Tyr Gly Asn Lys Lys Phe Glu 


Thr 


Leu 


Ser 


Tyr 


Leu 




50 






55 


60 










Pro 


Pro 


Leu 


Ser 


Thr Gly Gly Arg He Arg Cys 


Met 


Gin 


Ala 


Met 





65 70 75 



<210> 5 
<211> 268 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<22l> trans it_peptide 
<222> 1-267 

<223> coding sequence for PTPl comprising an Arabidopsis thaliana ssRUBISCO 
(SSU) chloroplast targeting sequence and sequences coding for the first 24 
amino acids of ssRUBISCO (SSU) protein (Wong et al., 1992) 

<400> 5 

atggcttcct ctatgctctc ttccgctact atggttgcct ctccggctca ggccactatg 60 

gtcgctcctt tcaacggact taagtcctcc gctgccttcc cagccacccg caaggctaac 120 

aacgacatta cttccatcac aagcaacggc ggaagagtta actgcatgca ggtgtggcct 180 

ccgattggaa agaagaagtt tgagactctc tcttaccttc ctgaccttac cgattccggt 240 

ggtcgcgtca actgcatgca ggccatgg 268 

<210> 6 
<211> 89 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 6 



Met Ala 


Ser 


Ser Met 


Leu 


Ser 


Ser Ala Thr Met Val 


Ala 


Ser Pro Ala 


1 




5 






10 




15 


Gin Ala 


Thr 


Met Val 


Ala 


Pro 


Phe Asn Gly Leu Lys 


Ser 


Ser Ala Ala 






20 






25 




30 


Phe Pro 


Ala 


Thr Arg 


Lys 


Ala 


Asn Asn Asp He Thr 


Ser 


He Thr Ser 




35 








40 


45 




Asn Gly 


Gly 


Arg Val 


Asn 


Cys 


Met Gin Val Trp Pro 


Pro 


He Gly Lys 


50 








55 


60 






Lys Lys 


Phe 


Glu Thr 


Leu 


Ser 


Tyr Leu Pro Asp Leu Thr Asp Ser Gly 


65 






70 




75 




80 


Gly Arg 


Val 


Asn Cys 


Met 


Gin 


Ala Met 







85 



<210> 7 
<211> 178 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 7 

atggcttcct ctatgctctc ttccgctact atggttgcct ctccggctca ggccactatg 60 
gtcgctcctt tcaacggact taagtcctcc gctgccttcc cagccacccg caaggctaac 120 
aacgacatta cttccatcac aagcaacggc ggaagagtta actgcatgca ggccatgg 178 

<210> 8 
<211> 59 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 8 

Met Ala Ser Ser Met Leu Ser 

1 5 
Gin Ala Thr Met Val Ala Pro 
20 

Phe Pro Ala Thr Arg Lys Ala 

35 

Asn Gly Gly Arg Val Asn Cys 
50 55 



Ser Ala Thr Met Val Ala Ser Pro Ala 

10 15 
Phe Asn Gly Leu Lys Ser Ser Ala Ala 

25 30 
Asn Asn Asp He Thr Ser He Thr Ser 
40 45 
Met Gin Ala Met 



<210> 9 
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<211> 240 
<212> DNA 

<213> Arabidopsis thalicoia 



<400> 9 

atggcgcaag ttagcagaat ctgcaatggt gtgcagaacc catctcttat ctccaatctc 60 

tcgaaatcca gtcaacgcaa atctccctta tcggtttctc tgaagacgca gcagcatcca 120 

cgagcttatc cgatttcgtc gtcgtgggga ttgaagaaga gtgggatgac gttaattggc 180 

tctgagcttc gtcctcttaa ggtcatgtct tctgtttcca cggcgtgcat gcttgccatg 240 

<210> 10 
<211> 80 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 10 


















Met Ala 


Gin Val 


Ser Arg 


He Cys Asn Gly Val 


Gin 


Asn 


Pro 


Ser 


Leu 


1 




5 


10 








15 




lie Ser 


Asn Leu 


Ser Lys 


Ser Ser Gin Arg Lys 


Ser 


Pro 


Leu 


Ser 


Val 




20 




25 






30 






Ser Leu 


Lys Thr 


Gin Gin 


His Pro Arg Ala Tyr 


Pro 


He 


Ser 


Ser 


Ser 




35 




40 




45 








Trp Gly 


Leu Lys Lys Ser Gly Met Thr Leu He Gly Ser Glu Leu Arg 


50 






55 


60 










Pro Leu 


Lys Val 


Met Ser 


Ser Val Ser Thr Ala 


Cys 


Met 


Leu 


Ala 


Met 


65 




70 


75 










80 



<210> 11 
<211> 1907 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> completely synthesized 



<400> 11 

ccatggacaa caacgtcttg aactctggta gaacaaccat ctgcgacgca tacaacgtcg 60 

tggctcacga tccattcagc ttcgaacaca agagcctcga cactattcag aaggagtgga 12 0 

tggaatggaa acgtactgac cactctctct acgtcgcacc tgtggttgga acagtgtcca 180 

gcttccttct caagaaggtc ggctctctca tcggaaaacg tatcttgtcc gaactctggg 240 

gtatcatctt tccatctggg tccactaatc tcatgcaaga catcttgagg gagaccgaac 300 

agtttctcaa ccagcgtctc aacactgata ccttggctag agtcaacgct gagttgatcg 360 

gtctccaagc aaacattcgt gagttcaacc agcaagtgga caacttcttg aatccaactc 420 

agaatcctgt gcctctttcc atcacttctt ccgtgaacac tatgcagcaa ctcttcctca 4 80 

acagattgcc tcagtttcag attcaaggct accagttgct ccttcttcca ctctttgctc 54 0 

aggctgccaa catgcacttg tec t teat ac gtgacgtgat cctcaacgct gacgaatggg 600 

gaatctctgc agccactctt aggacataca gagactactt gaggaactac actcgtgatt 660 

actccaacta ttgcatcaac acttatcaga ctgcctttcg tggactcaat actaggcttc 720 

acgacatgct tgagttcagg acctacatgt tccttaacgt gtttgagtac gtcagcattt 780 

ggagtctctt caagtaccag agcttgatgg tgtcctctgg agccaatctc tacgcctctg 84 0 

gcagtggacc acagcaaact cagagcttca cagctcagaa ctggccattc ttgtatagct 900 

tgttccaagt caactccaac tacattctca gtggtatctc tgggaccaga ctctccataa 960 

eetttcccaa cattggtgga cttccaggct ccactaeaac ccatagcctt aactctgcca 1020 

gagtgaacta cagtggaggt gtcagctctg gattgattgg tgcaactaac ttgaaccaca 1080 

acttcaattg ctccaccgtc ttgccacctc tgagcacacc gtttgtgagg tcctggcttg 1140 

acagcggtac tgatcgcgaa ggagttgcta cctctacaaa ctggcaaacc gagtccttcc 1200 

aaaccactct tagccttcgg tgtggagctt tctctgcacg tgggaattca aactactttc 1260 

cagactactt cattaggaac atctctggtg ttcctctcgt catcaggaat gaagacctca 132 0 

ccegtccact tcattacaac cagattagga acatcgagtc tccatccggt actccaggag 1380 

gtgcaagagc ttacctcgtg tctgtccata acaggaagaa caacatctac gctgceaacg 1440 
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agaatggcac catgattcac cttgcaccag aagattacac tggattcacc atctctccaa 1500 

tccatgctac ccaagtgaac aatcagacac gcaccttcat ctccgaaaag ttcggaaatc 1560 

aaggtgactc cttgaggttc gagcaatcca acactaccgc taggtacact ttgagaggca 1620 

atggaaacag ctacaacctt tacttgagag ttagctccat tggtaactcc accatccgtg 1680 

ttaccatcaa cggacgtgtt tacacagtct ctaatgtgaa cactacaacg aacaatgatg 1740 

gcgttaacga caacggagcc agattcagcg acatcaacat tggcaacatc gtggcctctg 1800 

acaacactaa cgttactttg gacatcaatg tgaccctcaa ttctggaact ccatttgatc 1860 

tcatgaacat catgtttgtg ccaactaacc tccctccatt gtactaa 1907 

<210> 12 
<211> 634 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 12 

Met Asp Asn Asn Val Leu Asn Ser Gly Arg Thr Thr He Cys Asp Ala 

15 10 15 

Tyr Asn Val Val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu 

20 25 30 

Asp Thr He Gin Lys Glu Trp Met Glu Trp Lys Arg Thr Asp His Ser 

35 40 45 

Leu Tyr Val Ala Pro Val Val Gly Thr Val Ser Ser Phe Leu Leu Lys 

50 55 60 

Lys Val Gly Ser Leu He Gly Lys Arg He Leu Ser Glu Leu Trp Gly 
65 70 75 80 

He He Phe Pro Ser Gly Ser Thr Asn Leu Met Gin Asp He Leu Arg 

85 90 95 

Glu Thr Glu Gin Phe Leu Asn Gin Arg Leu Asn Thr Asp Thr Leu Ala 

100 105 110 

Arg Val Asn Ala Glu Leu He Gly Leu Gin Ala Asn He Arg Glu Phe 

115 120 125 

Asn Gin Gin Val Asp Asn Phe Leu Asn Pro Thr Gin Asn Pro Val Pro 

130 135 140 

Leu Ser He Thr Ser Ser Val Asn Thr Met Gin Gin Leu Phe Leu Asn 
145 150 155 160 

Arg Leu Pro Gin Phe Gin He Gin Gly Tyr Gin Leu Leu Leu Leu Pro 

165 170 175 

Leu Phe Ala Gin Ala Ala Asn Met His Leu Ser Phe He Arg Asp Val 

180 185 190 

He Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Leu Arg Thr 

195 200 205 

Tyr Arg Asp Tyr Leu Arg Asn Tyr Thr Arg Asp Tyr Ser Asn Tyr Cys 

210 215 220 

He Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His 
225 230 235 240 

Asp Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr 

245 250 255 

Val Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Met Val Ser Ser 

260 265 270 

Gly Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser 

275 280 285 

Phe Thr Ala Gin Asn Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn 

290 295 300 

Ser Asn Tyr He Leu Ser Gly He Ser Gly Thr Arg Leu Ser He Thr 
305 310 315 320 

Phe Pro Asn He Gly Gly Leu Pro Gly Ser Thr Thr Thr His Ser Leu 

325 330 335 

Asn Ser Ala Arg Val Asn Tyr Ser Gly Gly Val Ser Ser Gly Leu He 

340 345 350 

Gly Ala Thr Asn Leu Asn His Asn Phe Asn Cys Ser Thr Val Leu Pro 
355 360 365 
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Pro 


Leu 
370 


Ser 


Thr 


Pro 


Phe 


Val 
375 


Arg 


Ser 


Trp 


Leu 


Asp 
380 


Ser 


Gly 


Thr 


Asp 


Arg 


Glu 


Gly Val 


Ala 


Thr 


Ser 


Thr 


Asn 


Trp 


Gin 


Thr 


Glu 


Ser 


Phe 


Gin 


385 










390 










395 










400 


Thr 


Thr 


Leu 


Ser 


Leu Arg 


Cys 


Gly 


Ala 


Phe 


Ser 


Ala 


Arg 


Gly 


Asn 


Ser 










405 










410 










415 




Asn 


Tyr 


Phe 


Pro 
420 


Asp 


Tyr 


Phe 


He 


Arg 

425 


Asn 


He 


Ser 


Gly 


Val 
430 


Pro 


Leu 


Val 


He 


Arg Asn 


Glu Asp 


Leu 


Thr 


Arg 


Pro 


Leu 


His 


Tyr 


Asn 


Gin 


He 






435 










440 










445 








Arg 


Asn 
450 


He 


Glu 


Ser 


Pro 


Ser 
455 


Gly 


Thr 


Pro 


Gly 


Gly 
460 


Ala 


Arg 


Ala 


Tyr 


Leu 


Val 


Ser 


Val 


His 


Asn 


Arg 


Lys 


Asn 


Asn 


He 


Tyr 


Ala 


Ala 


Asn 


Glu 


465 










470 










475 










480 


Asn 


Gly 


Thr 


Met 


He 
485 


His 


Leu 


Ala 


Pro 


Glu 
490 


Asp 


Tyr 


Thr 


Gly 


Phe 
495 


Thr 


He 


Ser 


Pro 


He 

500 


His 


Ala 


Thr 


Gin 


Val 

505 


Asn 


Asn 


Gin 


Thr 


Arg 
510 


Thr 


Phe 


He 


Ser 


Glu 
515 


Lys 


Phe 


Gly 


Asn 


Gin 
520 


Gly 


^sp 


Ser 


Leu 


Arg 
525 


Phe 


Glu 


Gin 


Ser 


Asn 
530 


Thr 


Thr 


Ala 


Arg 


Tyr 
535 


Thr 


Leu 


Arg 


Gly 


Asn 
540 


Gly 


Asn 


Ser 


Tyr 


Asn 


Leu 


Tyr Leu 


Arg 


Val 


Ser 


Ser 


He 


Gly 


Asn 


Ser 


Thr 


He 


Arg 


Val 


545 










550 










555 










560 


Thr 


He 


Asn Gly 


Arg 


Val 


Tyr 


Thr 


Val 


Ser 


Asn 


Val 


Asn 


Thr 


Thr 


Thr 










565 










570 










575 




Asn 


Asn 


Asp Gly 


Val 


Asn 


Asp 


Asn 


Gly 


Ala 


Arg 


Phe 


Ser 


Asp 


He 


Ash 








580 










585 










590 






He 


Gly 


Asn 
595 


He 


Val 


Ala 


Ser 


Asp 
600 


Asn 


Thr 


Asn 


Val 


Thr 
605 


Leu 


Asp 


He 


Asn 


Val 
610 


Thr 


Leu 


Asn 


Ser 


Gly 

615 


Thr 


Pro 


Phe 


Asp 


Leu 
620 


Met 


Asn 


He 


Met 


Phe 


Val 


Pro 


Thr 


Asn 


Leu 


Pro 


Pro 


Leu 


Tyr 















625 630 



<210> 13 
<211> 10339 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> unsure 

<222> 3687-3760; 4382-4434; 
<223> "n" s g, a, c, or t 



<400> 13 

ggccgcgtta actgcaggtc cgatgtgaga cttttcaaca aagggtaata tccggaaacc 60 

tcctcggatt ccattgccca gctatctgtc actttattgt gaagatagtg gaaaaggaag 120 

gtggctccta caaatgccat cattgcgata aaggaaaggc catcgttgaa gatgcctctg 180 

ccgacagtgg tcccaaagat ggacccccac ccacgaggag catcgtggaa aaagaagacg 240 

ttccaaccac gtcttcaaag caagtggatt gatgtgatgg tccgatgtga gacttttcaa 300 

caaagggtaa tatccggaaa cctcctcgga ttccattgcc cagctatctg tcactttatt 36 0 

gtgaagatag tggaaaagga aggtggctcc tacaaatgcc atcattgcga taaaggaaag 42 0 

gccatcgttg aagatgcctc tgccgacagt ggtcccaaag atggaccccc acccacgagg 480 

agcatcgtgg aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga ttgatgtgat 54 0 

atctccactg acgtaaggga tgacgcacaa tcccactatc cttcgcaaga cccttcctct 600 

atataaggaa gttcatttca tttggagagg acacagaaaa atttgctaca ttgtttcaca 660 

aacttcaaat attattcatt tatttgtcag ctttcaaact ctttgtttct tgtttgttga 720 

ttgagaatac aatggcttcc tctatgctct cttccgctac tatggttgcc tctccggctc 780 

aggccactat ggtcgctcct ttcaacggac ttaagtcctc cgctgccttc ccagccaccc 84 0 

gcaaggctaa caacgacatt acttccatca caagcaacgg cggaagagtt aactgcatgc 900 
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^99tgtggcc tccgattgga aagaagaagt ttgagactct ctcttacctt cctgacctta 960 

ccgattccgg tggtcgcgtc aactgcatgc aggccatgga caactccgtc ctgaactctg 1020 

gtcgcaccac catctgcgac gcctacaacg tcgcggcgca tgatccattc agcttccagc 1080 

acaagagcct cgacactgtt cagaaggagt ggacggagtg gaagaagaac aaccacagcc 1140 

tgtacctgga ccccatcgtc ggcacggtgg ccagcttcct tctcaagaag gtcggctctc 1200 

tcgtcgggaa gcgcatcctc tcggaactcc gcaacctgat ctttccatct ggctccacca 1260 

acctcatgca agacatcctc agggagaccg agaagtttct caaccagcgc ctcaacactg 1320 

atacccttgc tcgcgtcaac gctgagctga cgggtctgca agcaaacgtg gaggagttca 1380 

accgccaagt ggacaacttc ctcaacccca accgcaatgc ggtgcctctg tccatcactt 1440 

cttccgtgaa caeca tgcaa caactgttcc tcaaccgctt gcctcagttc cagatgcaag 1500 

gctaccagct gctcctgctg ccactctttg ctcaggctgc caacctgcac ctctccttca 156 0 

ttcgtgacgt gatcctcaac gctgacgagt ggggcatctc tgcagccacg ctgaggacct 1620 

accgcgacta cctgaagaac tacaccaggg actactccaa ctattgcatc aacacctacc 1680 

agtcggcctt caagggcctc aatacgaggc ttcacgacat gctggagttc aggacctaca 174 0 

tgttcctgaa cgtgttcgag tacgtcagca tctggtcgct cttcaagtac cagagcctgc 1800 

tggtgtccag cggcgccaac ctctacgcca gcggctctgg tccccaacaa actcagagct 1860 

tcaccagcca ggactggcca ttcctgtatt cgttgttcca agtcaactcc aactacgtcc 1920 

tcaacggctt ctctggtgct cgcctctcca acaccttccc caacattgtt ggcctccccg 1980 

gctccaccac aactcatgct ctgcttgctg ccagagtgaa ctactccggc ggcatctcga 2 040 

gcggcgacat tggtgcatcg ccgttcaacc agaacttcaa ctgctccacc ttcctgccgc 2100 

cgctgctcac cccgttcgtg aggtcctggc tcgacagcgg ctccgaccgc gagggcgtgg 2160 

ccaccgtcac caactggcaa accgagtcct tcgagaccac ccttggcctc cggagcggcg 2220 

ccttcacggc gcgtgggaat tctaactact tccccgacta cttcatcagg aacatctctg 2280 

gtgttcctct cgtcgtccgc aacgaggacc tccgccgtcc actgcactac aacgagatca 234 0 

ggaacatcgc ctctccgtcc gggacgcccg gaggtgcaag ggcgtacatg gtgagcgtcc 2400 

ataacaggaa gaacaacatc cacgctgtgc atgagaacgg ctccatgatc cacctggcgc 2460 

ccaatgatta caccggcttc accatctctc caatccacgc cacccaagtg aacaaccaga 2520 

cacgcacctt catctccgag aagttcggca accagggcga ctccctgagg ttcgagcaga 2580 

acaacaccac cgccaggtac accctgcgcg gcaacggcaa cagctacaac ctgtacctgc 264 0 

gcgtcagctc cattggcaac tccaccatca gggtcaccat caacgggagg gtgtacacag 2700 

ccaccaatgt gaacacgacg accaacaatg atggcgtcaa cgacaacggc gcccgcttca 2760 

gcgacatcaa cattggcaac gtggtggcca gcagcaactc cgacgtcccg ctggacatca 2820 

acgtgaccct gaactctggc acccagttcg acctcatgaa catcatgctg gtgccaacta 2880 

acatctcgcc gctgtactga taggagctct gatccccatg ggaattcccg atcgttcaaa 294 0 

catttggcaa taaagtttct taagattgaa tcctgttgcc ggtcttgcga tgattatcat 3000 

ataatttctg ttgaattacg ttaagcatgt aataattaac atgtaatgca tgacgttatt 3060 

tatgagatgg gtttttatga ttagagtccc gcaattatac atttaatacg cgatagaaaa 312 0 

caaaatatag cgcgcaaact aggataaatt atcgcgcgcg gtgtcatcta tgttactaga 3180 

tcggggatat ccccggggcg gccgctcgag tggtggccgc atcgatcgtg aagtttctca 3240 

tctaagcccc catttggacg tgaatgtaga cacgtcgaaa taaagatttc cgaattagaa 3300 

taatttgttt attgctttcg cctataaata cgacggatcg taatttgtcg ttttatcaaa 3360 

atgtactttc attttataat aacgctgcgg acatctacat ttttgaattg aaaaaaaatt 3420 

ggtaattact ctttcttttt ctccatattg accatcatac tcattgctga tccatgtaga 3480 

tttcccggac atgaagccat ttacaattga atatatcctg ccgccgctgc cgctttgcac 3540 

ccggtggagc ttgcatgttg gtttctacgc agaactgagc cggttaggca gataatttcc 3600 

attgagaact gagccatgtg caccttcccc ccaacacggt gagcgacggg gcaacggagt 3660 

gatccacatg ggacttttcc tagcttnnnn nnnnnnniinn nnnnnnnnnn nnnnnnnnnn 3720 

nnnnnnimnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ccgggagggt tcgagaaggg 3780 

ggggcacccc ccttcggcgt gcgcggtcac gcgccagggc gcagccctgg ttaaaaacaa 3840 

ggtttataaa tattggttta aaagcaggtt aaaagacagg ttagcggtgg ccgaaaaacg 3900 

ggcggaaacc cttgcaaatg ctggattttc tgcctgtgga cagcccctca aatgtcaata 3960 

ggtgcgcccc tcatctgtca tcactctgcc cctcaagtgt caaggatcgc gcccctcatc 402 0 

tgtcagtagt cgcgcccctc aagtgtcaat accgcagggc acttatcccc aggcttgtcc 408 0 

acatcatctg tgggaaactc gcgtaaaatc aggcgttttc gccgatttgc gaggctggcc 4140 

agctccacgt cgccggccga aatcgagcct gcccctcatc tgtcaacgcc gcgccgggtg 4200 

agtcggcccc tcaagtgtca acgtccgccc ctcatctgtc agtgagggcc aagttttccg 4260 

cgtggtatcc acaacgccgg cggccggccg cggtgtctcg cacacggctt cgacggcgtt 4 32 0 

tctggcgcgt ttgcagggcc atagacggcc gccagcccag cggcgagggc aaccagcccg 4 380 

gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngtcgat 444 0 

cgaccgatgc ccttgagagc cttcaaccca gtcagctcct tccggtgggc gcggggcatg 4500 

actatcgtcg ccgcacttat gactgtcttc tttatcatgc aactcgtagg acaggtgccg 4 56 0 
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gcagcgctct gggtcatttt cggcgaggac cgctttcgct ggagcgcgac gatgatcggc 4620 

ctgtcgcttg cggtattcgg aatcttgcac gccctcgctc aagccttcgt cactggtccc 4 680 

gccaccaaac gtttcggcga gaagcaggcc attatcgccg gcatggcggc cgacgcgctg 4740 

ggctacgtct tgctggcgtt cgcgacgcga ggctggatgg ccttccccat tatgattctt 4800 

ctcgcttccg gcggcatcgg gatgcccgcg ttgcaggcca tgctgtccag gcaggtagat 4860 

gacgaccatc agggacagct tcaaggatcg ctcgcggctc ttaccagcct aacttcgatc 4 92 0 

actggaccgc tgatcgtcac ggcgatttat gccgcctcgg cgagcacatg gaacgggttg 4 980 

gcatggattg taggcgccgc cctatacctt gtctgcctcc ccgcgttgcg tcgcggtgca 5040 

tggagccggg ccacctcgac ctgaatggaa gccggcggca cctcgctaac ggattcacca 5100 

ctccaagaat tggagccaat caattcttgc ggagaactgt gaatgcgcaa accaaccctt 5160 

ggcagaacat atccatcgcg tccgccatct ccagcagccg cacgcggcgc atctcgggca 5220 

gcgttgggtc ctggccacgg gtgcgcatga tcgtgctcct gtcgttgagg acccggctag 5280 

gctggcgggg ttgccttact ggttagcaga atgaatcacc gatacgcgag cgaacgtgaa 5340 

gcgactgctg ctgcaaaacg tctgcgacct gagcaacaac atgaatggtc ttcggtttcc 5400 

gtgtttcgta aagtctggaa acgcggaagt cagcgccctg caccattatg ttccggatct 5460 

gcatcgcagg atgctgctgg ctaccctgtg gaacacctac atctgtatta acgaagcgct 5520 

ggcattgacc ctgagtgatt tttctctggt cccgccgcat ccataccgcc agttgtttac 5580 

cctcacaacg ttccagtaac cgggcatgtt catcatcagt aacccgtatc gtgagcatcc 5640 

tctctcgttt catcggtatc attaccccca tgaacagaaa ttccccctta cacggaggca 5700 

tcaagtgacc aaacaggaaa aaaccgccct taacatggcc cgctttatca gaagccagac 576 0 

attaacgctt ctggagaaac tcaacgagct ggacgcggat gaacaggcag acatctgtga 5820 

atcgcttcac gaccacgctg atgagcttta ccgcagctgc ctcgcgcgtt tcggtgatga 5880 

cggtgaaaac ctctgacaca tgcagctccc ggagacggtc acagcttgtc tgtaagcgga 5940 

tgccgggagc agacaagccc gtcagggcgc gtcagcgggt gttggcgggt gtcggggcgc 6000 

agccatgacc cagtcacgta gcgatagcgg agtgtatact ggcttaacta tgcggcatca 6 060 

gagcagattg tactgagagt gcaccatatg cggtgtgaaa taccgcacag atgcgtaagg 6120 

agaaaatacc gcatcaggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc 6180 

gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa 6240 

tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt 6300 

aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa 6360 

aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt 6420 

ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg 6480 

tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc 6540 

agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc 6600 

gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta 6660 

tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct 6720 

acagagttct tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc 6780 

tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa 6840 

caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa 6900 

aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa 6960 

aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 7 020 

ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac 7080 

agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc 7140 

atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt accatctggc 7200 

cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata 7260 

aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc 7320 

cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc 7380 

aacgttgttg ccattgctgc aggtcgggag cacaggatga cgcctaacaa ttcattcaag 7440 

ccgacaccgc ttcgcggcgc ggcttaattc aggagttaaa catcatgagg gaagcggtga 7500 

tcgccgaagt atcgactcaa ctatcagagg tagttggcgt catcgagcgc catctcgaac 7560 

cgacgttgct ggccgtacat ttgtacggct ccgcagtgga tggcggcctg aagccacaca 7620 

gtgatattga tttgctggtt acggtgaccg taaggcttga tgaaacaacg cggcgagctt 7680 

tgatcaacga ccttttggaa acttcggctt cccctggaga gagcgagatt ctccgcgctg 774 0 

tagaagtcac cattgttgtg cacgacgaca tcattccgtg gcgttatcca gctaagcgcg 7800 

aactgcaatt tggagaatgg cagcgcaatg acattcttgc aggtatcttc gagccagcca 7860 

cgatcgacat tgatctggct atcttgctga caaaagcaag agaacatagc gttgccttgg 7920 

taggtccagc ggcggaggaa ctctttgatc cggttcctga acaggatcta tttgaggcgc 7980 

taaatgaaac cttaacgcta tggaactcgc cgcccgactg ggctggcgat gagcgaaatg 804 0 

tagtgcttac gttgtcccgc atttggtaca gcgcagtaac cggcaaaatc gcgccgaagg 8100 

atgtcgctga agactgggca atggagcgcc tgccggccca gtatcagccc gtcatacttg 8160 

aagctaggca ggcttatctt ggacaagaag atcgcttggc ctcgcgcgca gatcagttgg 8220 
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aagaatttgt tcactacgtg aaaggcgaga tcaccaaggt agtcggcaaa taatgtctaa 828 0 

caattcgttc aagccgacgc cgcttcgcgg cgcggcttaa ctcaagcgtt agatgctgca 834 0 

ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 8400 

tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 8460 

ccgatcgagg atttttcggc gctgcgctac gtccgckacc gcgttgaggg atcaagccac 8520 

agcagcccac tcgacctcta gccgacccag acgagccaag ggatcttttt ggaatgctgc 8580 

tccgtcgtca ggctttccga cgtttgggtg gttgaacaga agtcattatc gtacggaatg 8640 

ccaagcactc ccgaggggaa ccctgtggtt ggcatgcaca tacaaatgga cgaacggata 8700 

aaccttttca cgccctttta aatatccgtt attctaataa acgctctttt ctcttaggtt 8760 

tacccgccaa tatatcctgt caaacactga tagtttaaac tgaaggcggg aaacgacaat 8820 

ctgatcccca tcaagcttgg tcgagtggaa gctagcttcc cgatcctatc tgtcacttca 8880 

tcaaaaggac agtagaaaag gaaggtggca ctacaaatgc catcattgcg ataaaggaaa 8940 

ggctatcgtt caagatgcct ctgccgacag tggtcccaaa gatggacccc cacccacgag 9000 

gagcatcgtg gaaaaagaag acgttccaac cacgtcttca aagcaagtgg attgatgtga 9060 

tacttccact gacgtaaggg atgacgcaca atcccactat ccttcgcaag acccttcctc 9120 

tatataagga agttcatttc atttggagag gacacgctga aatcaccagt ctctctctac 9180 

aagatcgggg atctctagct agacgatcgt ttcgcatgat tgaacaagat ggattgcacg 9240 

caggttctcc ggccgcttgg gtggagaggc tattcggcta tgactgggca caacagacaa 9300 

tcggctgctc tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg gttctttttg 9360 

tcaagaccga cctgtccggt gccctgaatg aactgcagga cgaggcagcg cggctatcgt 9420 

ggctggccac gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact gaagcgggaa 94 80 

gggactggct gctattgggc gaagtgccgg ggcaggatct cctgtcatct caccttgctc 9540 

ctgccgagaa agtatccatc atggctgatg caatgcggcg gctgcatacg cttgatccgg 9600 

ctacctgccc attcgaccac caagcgaaac atcgcatcga gcgagcacgt actcggatgg 9660 

aagccggtct tgtcgatcag gatgatctgg acgaagagca tcaggggctc gcgccagccg 9720 

aactgttcgc caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc gtgacccatg 97 80 

gcgatgcctg cttgccgaat atcatggtgg aaaatggccg cttttctgga ttcatcgact 9840 

gtggccggct gggtgtggcg gaccgctatc aggacatagc gttggctacc cgtgatattg 9900 

ctgaagagct tggcggcgaa tgggctgacc gcttcctcgt gctttacggt atcgccgctc 9960 

ccgattcgca gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga gcgggactct 10020 

ggggttcgat ccccaattcc cgatcgttca aacatttggc aataaagttt cttaagattg 10080 

aatcctgttg ccggtcttgc gatgattatc atataatttc tgttgaatta cgttaagcat 10140 

gtaataatta acatgtaatg catgacgtta tttatgagat gggtttttat gattagagtc 102 00 

ccgcaattat acatttaata cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa 10260 

ttatcgcgcg cggtgtcatc tatgttacta gatcggggat cgggccactc gaccaagctt 10320 

ctgcaggtcc tgctcgagc 10339 



<210> 14 
<211> 10249 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> unsure 

<222> 3597-3670; 4292-4344; 
<223> "n" = g, a, c, or t 



<400> 14 

ggccgcgtta actgcaggtc cgatgtgaga cttttcaaca aagggtaata tccggaaacc 60 

tcctcggatt ccattgccca gctatctgtc actttattgt gaagatagtg gaaaaggaag 120 

gtggctccta caaatgccat cattgcgata aaggaaaggc catcgttgaa gatgcctctg 180 

ccgacagtgg tcccaaagat ggacccccac ccacgaggag catcgtggaa aaagaagacg 240 

ttccaaccac gtcttcaaag caagtggatt gatgtgatgg tccgatgtga gacttttcaa 300 

caaagggtaa tatccggaaa cctcctcgga ttccattgcc cagctatctg tcactttatt 360 

gtgaagatag tggaaaagga aggtggctcc tacaaatgcc atcattgcga taaaggaaag 420 

gccatcgttg aagatgcctc tgccgacagt ggtcccaaag atggaccccc acccacgagg 4 80 

agcatcgtgg aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga ttgatgtgat 54 0 

atctccactg acgtaaggga tgacgcacaa tcccactatc cttcgcaaga cccttcctct 6 00 

atataaggaa gttcatttca tttggagagg acacagaaaa atttgctaca ttgtttcaca 660 
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aacttcaaat attattcatt tatttgtcag ctttcaaact ctttgtttct tgtttgttga 720 

ttgagaatac aatggcttcc tctatgctct cttccgctac tatggttgcc tctccggctc 780 

aggccactat ggtcgctcct ttcaacggac ttaagtcctc cgctgccttc ccagccaccc 840 

gcaaggctaa caacgacatt acttccatca caagcaacgg cggaagagtt aactgcatgc 900 

aggccatgga caactccgtc ctgaactctg gtcgcaccac catctgcgac gcctacaacg 96 0 

tcgcggcgca tgatccattc agcttccagc acaagagcct cgacactgtt cagaaggagt 1020 

ggacggagtg gaagaagaac aaccacagcc tgtacctgga ccccatcgtc ggcacggtgg 1080 

ccagcttcct tctcaagaag gtcggctctc tcgtcgggaa gcgcatcctc tcggaactcc 1140 

gcaacctgat ctttccatct ggctccacca acctcatgca agacatcctc agggagaccg 1200 

agaagtttct caaccagcgc ctcaacactg atacccttgc tcgcgtcaac gctgagctga 1260 

cgggtctgca agcaaacgtg gaggagttca accgccaagt ggacaacttc ctcaacccca 1320 

accgcaatgc ggtgcctctg tccatcactt cttccgtgaa caccatgcaa caactgttcc 1380 

tcaaccgctt gcctcagttc cagatgcaag gctaccagct gctcctgctg ccactctttg 1440 

ctcaggctgc caacctgcac ctctccttca ttcgtgacgt gatcctcaac gctgacgagt 1500 

Mggcatctc tgcagccacg ctgaggacct accgcgacta cctgaagaac tacaccaggg 1560 

actactccaa ctattgcatc aacacctacc agtcggcctt caagggcctc aatacgaggc 1620 

ttcacgacat gctggagttc aggacctaca tgttcctgaa cgtgttcgag tacgtcagca 1680 

tctggtcgct cttcaagtac cagagcctgc tggtgtccag cggcgccaac ctctacgcca 1740 

gcggctctgg tccccaacaa actcagagct tcaccagcca ggactggcca ttcctgtatt 1800 

cgttgttcca agtcaactcc aactacgtcc tcaacggctt ctctggtgct cgcctctcca 1860 

acaccttccc caacattgtt ggcctccccg gctccaccac aactcatgct ctgcttgctg 1920 

ccagagtgaa ctactccggc ggcatctcga gcggcgacat tggtgcatcg ccgttcaacc 1980 

agaacttcaa ctgctccacc ttcctgccgc cgctgctcac cccgttcgtg aggtcctggc 2040 

tcgacagcgg ctccgaccgc gagggcgtgg ccaccgtcac caactggcaa accgagtcct 2100 

tcgagaccac ccttggcctc cggagcggcg ccttcacggc gcgtgggaat tctaactact 2160 

tccccgacta cttcatcagg aacatctctg gtgttcctct cgtcgtccgc aacgaggacc 2220 

tccgccgtcc actgcactac aacgagatca ggaacatcgc ctctccgtcc gggacgcccg 2280 

gaggtgcaag ggcgtacatg gtgagcgtcc ataacaggaa gaacaacatc cacgctgtgc 2340 

atgagaacgg ctccatgatc cacctggcgc ccaatgatta caccggcttc accatctctc 2400 

caatccacgc cacccaagtg aacaaccaga cacgcacctt catctccgag aagttcggca 2460 

accagggcga ctccctgagg ttcgagcaga acaacaccac cgccaggtac accctgcgcg 2520 

gcaacggcaa cagctacaac ctgtacctgc gcgtcagctc cattggcaac tccaccatca 2580 

gggtcaccat caacgggagg gtgtacacag ccaccaatgt gaacacgacg accaacaatg 264 0 

atggcgtcaa cgacaacggc gcccgcttca gcgacatcaa cattggcaac gtggtggcca 2 700 

gcagcaactc cgacgtcccg ctggacatca acgtgaccct gaactctggc acccagttcg 2760 

acctcatgaa catcatgctg gtgccaacta acatctcgcc gctgtactga taggagctct 2820 

gatccccatg ggaattcccg atcgttcaaa catttggcaa taaagtttct taagattgaa 2880 

tcctgttgcc ggtcttgcga tgattatcat ataatttctg ttgaattacg ttaagcatgt 2940 

aataattaac atgtaatgca tgacgttatt tatgagatgg gtttttatga ttagagtccc 3000 

gcaattatac atttaatacg cgatagaaaa caaaatatag cgcgcaaact aggataaatt 3060 

atcgcgcgcg gtgtcatcta tgttactaga tcggggatat ccccggggcg gccgctcgag 3120 

tggtggccgc atcgatcgtg aagtttctca tctaagcccc catttggacg tgaatgtaga 3180 

cacgtcgaaa taaagatttc cgaattagaa taatttgttt attgctttcg cctataaata 3240 

cgacggatcg taatttgtcg ttttatcaaa atgtactttc attttataat aacgctgcgg 33 00 

acatctacat ttttgaattg aaaaaaaatt ggtaattact ctttcttttt ctccatattg 3360 

accatcatac tcattgctga tccatgtaga tttcccggac atgaagccat ttacaattga 342 0 

atatatcctg ccgccgctgc cgctttgcac ccggtggagc ttgcatgttg gtttctacgc 3480 

agaactgagc cggttaggca gataatttcc attgagaact gagccatgtg caccttcccc 3540 

ccaacacggt gagcgacggg gcaacggagt gatccacatg ggacttttcc tagcttnnnn 3600, 

imnnnnnnnn nnnnimnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn imnnnnnnnn 366 0 

imnnnnnnnn ccgggagggt tcgagaaggg ggggcacccc ccttcggcgt gcgcggtcac 372 0 

gcgccagggc gcagccctgg ttaaaaacaa ggtttataaa tattggttta aaagcaggtt 3780 

aaaagacagg ttagcggtgg ccgaaaaacg ggcggaaacc cttgcaaatg ctggattttc 384 0 

tgcctgtgga cagcccctca aatgtcaata ggtgcgcccc tcatctgtca tcactctgcc 3900 

cctcaagtgt caaggatcgc gcccctcatc tgtcagtagt cgcgcccctc aagtgtcaat 3960 

accgcagggc acttatcccc aggcttgtcc acatcatctg tgggaaactc gcgtaaaatc 4020 

aggcgttttc gccgatttgc gaggctggcc agctccacgt cgccggccga aatcgagcct 4 08 0 

gcccctcatc tgtcaacgcc gcgccgggtg agtcggcccc tcaagtgtca acgtccgccc 414 0 

ctcatctgtc agtgagggcc aagttttccg cgtggtatcc acaacgccgg cggccggccg 4200 

cggtgtctcg cacacggctt cgacggcgtt tctggcgcgt ttgcagggcc atagacggcc 4260 

gccagcccag cggcgagggc aaccagcccg gnnnnimrmn nnnnnnnnnn nnnnnnnnnn 4320 
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nnnnnnnnnn nnnnnnnnnn nnnngtcgat cgaccgatgc ccttgagagc cttcaaccca 4380 

gtcagctcct tccggtgggc gcggggcatg actatcgtcg ccgcacttat gactgtcttc 444 0 

tttatcatgc aactcgtagg acaggtgccg gcagcgctct gggtcatttt cggcgaggac 4500 

cgctttcgct ggagcgcgac gatgatcggc ctgtcgcttg cggtattcgg aatcttgcac 4560 

gccctcgctc aagccttcgt cactggtccc gccaccaaac gtttcggcga gaagcaggcc 4620 

attatcgccg gcatggcggc cgacgcgctg ggctacgtct tgctggcgtt cgcgacgcga 46 80 

ggctggatgg ccttccccat tatgattctt ctcgcttccg gcggcatcgg gatgcccgcg 4740 

ttgcaggcca tgctgtccag gcaggtagat gacgaccatc agggacagct tcaaggatcg 4800 

ctcgcggctc ttaccagcct aacttcgatc actggaccgc tgatcgtcac ggcgatttat 4860 

gccgcctcgg cgagcacatg gaacgggttg gcatggattg taggcgccgc cctatacctt 4920 

gtctgcctcc ccgcgttgcg tcgcggtgca tggagccggg ccacctcgac ctgaatggaa 4980 

gccggcggca cctcgctaac ggattcacca ctccaagaat tggagccaat caattcttgc 5040 

ggagaactgt gaatgcgcaa accaaccctt ggcagaacat atccatcgcg tccgccatct 5100 

ccagcagccg cacgcggcgc atctcgggca gcgttgggtc ctggccacgg gtgcgcatga 5160 

tcgtgctcct gtcgttgagg acccggctag gctggcgggg ttgccttact ggttagcaga 5220 

atgaatcacc gatacgcgag cgaacgtgaa gcgactgctg ctgcaaaacg tctgcgacct 5280 

gagcaacaac atgaatggtc ttcggtttcc gtgtttcgta aagtctggaa acgcggaagt 5340 

cagcgccctg caccattatg ttccggatct gcatcgcagg atgctgctgg ctaccctgtg 54 00 

gaacacctac atctgtatta acgaagcgct ggcattgacc ctgagtgatt tttctctggt 5460 

cccgccgcat ccataccgcc agttgtttac cctcacaacg ttccagtaac cgggcatgtt 5520 

catcatcagt aacccgtatc gtgagcatcc tctctcgttt catcggtatc attaccccca 5580 

tgaacagaaa ttccccctta cacggaggca tcaagtgacc aaacaggaaa aaaccgccct 5640 

taacatggcc cgctttatca gaagccagac attaacgctt ctggagaaac tcaacgagct 5700 

ggacgcggat gaacaggcag acatctgtga atcgcttcac gaccacgctg atgagcttta 5760 

ccgcagctgc ctcgcgcgtt tcggtgatga cggtgaaaac ctctgacaca tgcagctccc 5820 

ggagacggtc acagcttgtc tgtaagcgga tgccgggagc agacaagccc gtcagggcgc 5880 

gtcagcgggt gttggcgggt gtcggggcgc agccatgacc cagtcacgta gcgatagcgg 594 0 

agtgtatact ggcttaacta tgcggcatca gagcagattg tactgagagt gcaccatatg 6000 

cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc gcatcaggcg ctcttccgct 6060 

tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 6120 

tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 6180 

gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 624 0 

aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 6300 

ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 6360 

gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 6420 

ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg 6480 

ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 6540 

cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 66 00 

attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 6660 

ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 6720 

aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 6780 

gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt 6840 

tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 6900 

ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 6960 

taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 7020 

atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata 7080 

actacgatac gggagggctt accatctggc cccagtgctg caatgatacc gcgagaccca 7140 

cgctcaccgg ctccagattt atcagcaata aaccagccag ccggaagggc cgagcgcaga 7200 

agtggtcctg caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga 7260 

gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg ccattgctgc aggtcgggag 7320 

cacaggatga cgcctaacaa ttcattcaag ccgacaccgc ttcgcggcgc ggcttaattc 7380 

a99^gttaaa catcatgagg gaagcggtga tcgccgaagt atcgactcaa ctatcagagg 744 0 

tagttggcgt catcgagcgc catctcgaac cgacgttgct ggccgtacat ttgtacggct 7500 

ccgcagtgga tggcggcctg aagccacaca gtgatattga tttgctggtt acggtgaccg 7560 

taaggcttga tgaaacaacg cggcgagctt tgatcaacga ccttttggaa acttcggctt 7620 

cccctggaga gagcgagatt ctccgcgctg tagaagtcac cattgttgtg cacgacgaca 7680 

tcattccgtg gcgttatcca gctaagcgcg aactgcaatt tggagaatgg cagcgcaatg 7740 

acattcttgc aggtatcttc gagccagcca cgatcgacat tgatctggct atcttgctga 7800 

caaaagcaag agaacatagc gttgccttgg taggtccagc ggcggaggaa ctctttgatc 7860 

cggttcctga acaggatcta tttgaggcgc taaatgaaac cttaacgcta tggaactcgc 7920 

cgcccgactg ggctggcgat gagcgaaatg tagtgcttac gttgtcccgc atttggtaca 7980 
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gcgcagtaac cggcaaaatc gcgccgaagg atgtcgctga agactgggca atggagcgcc 804 0 

tgccggccca gtatcagccc gtcatacttg aagctaggca ggcttatctt ggacaagaag 8100 

atcgcttggc ctcgcgcgca gatcagttgg aagaatttgt tcactacgtg aaaggcgaga 8160 

tcaccaaggt agtcggcaaa taatgtctaa caattcgttc aagccgacgc cgcttcgcgg 8220 

cgcggcttaa ctcaagcgtt agatgctgca ggcatcgtgg tgtcacgctc gtcgtttggt 8280 

atggcttcat tcagctccgg ttcccaacga tcaaggcgag ttacatgatc ccccatgttg 8340 

tgcaaaaaag cggttagctc cttcggtcct ccgatcgagg atttttcggc gctgcgctac 8400 

gtccgckacc gcgttgaggg atcaagccac agcagcccac tcgacctcta gccgacccag 8460 

acgagccaag ggatcttttt ggaatgctgc tccgtcgtca ggctttccga cgtttgggtg 8520 

gttgaacaga agtcattatc gtacggaatg ccaagcactc ccgaggggaa ccctgtggtt 8580 

ggcatgcaca tacaaatgga cgaacggata aaccttttca cgccctttta aatatccgtt 8640 

attctaataa acgctctttt ctcttaggtt tacccgccaa tatatcctgt caaacactga 8700 

tagtttaaac tgaaggcggg aaacgacaat ctgatcccca tcaagcttgg tcgagtggaa 8760 

gctagcttcc cgatcctatc tgtcacttca tcaaaaggac agtagaaaag gaaggtggca 882 0 

ctacaaatgc catcattgcg ataaaggaaa ggctatcgtt caagatgcct ctgccgacag 8880 

tggtcccaaa gatggacccc cacccacgag gagcatcgtg gaaaaagaag acgttccaac 8940 

cacgtcttca aagcaagtgg attgatgtga tacttccact gacgtaaggg atgacgcaca 9000 

atcccactat ccttcgcaag acccttcctc tatataagga agttcatttc atttggagag 906 0 

gacacgctga aatcaccagt ctctctctac aagatcgggg atctctagct agacgatcgt 912 0 

ttcgcatgat tgaacaagat ggattgcacg caggttctcc ggccgcttgg gtggagaggc 9180 

tattcggcta tgactgggca caacagacaa tcggctgctc tgatgccgcc gtgttccggc 9240 

tgtcagcgca ggggcgcccg gttctttttg tcaagaccga cctgtccggt gccctgaatg 9300 

aactgcagga cgaggcagcg cggctatcgt ggctggccac gacgggcgtt ccttgcgcag 9360 

ctgtgctcga cgttgtcact gaagcgggaa gggactggct gctattgggc gaagtgccgg 9420 

ggcaggatct cctgtcatct caccttgctc ctgccgagaa agtatccatc atggctgatg 9480 

caatgcggcg gctgcatacg cttgatccgg ctacctgccc attcgaccac caagcgaaac 9540 

atcgcatcga gcgagcacgt actcggatgg aagccggtct tgtcgatcag gatgatctgg 9600 

acgaagagca tcaggggctc gcgccagccg aactgttcgc caggctcaag gcgcgcatgc 9660 

ccgacggcga ggatctcgtc gtgacccatg gcgatgcctg cttgccgaat atcatggtgg 9720 

aaaatggccg cttttctgga ttcatcgact gtggccggct gggtgtggcg gaccgctatc 9780 

aggacatagc gttggctacc cgtgatattg ctgaagagct tggcggcgaa tgggctgacc 9840 

gcttcctcgt gctttacggt atcgccgctc ccgattcgca gcgcatcgcc ttctatcgcc 9900 

ttcttgacga gttcttctga gcgggactct ggggttcgat ccccaattcc cgatcgttca 9960 

aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc gatgattatc 10020 

atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg catgacgtta 10080 

tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata cgcgatagaa 10140 

aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc tatgttacta 10200 

gatcggggat cgggccactc gaccaagctt ctgcaggtcc tgctcgagc 10249 



<210> 15 
<211> 10312 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> unsure 

<222> 3660-3773; 4355-4407; 
<223> "n" = g, a, c, or t 



cttttcaaca aagggtaata tccggaaacc 60 

actttattgt gaagatagtg gaaaaggaag 120 

aaggaaaggc catcgttgaa gatgcctctg 180 

ccacgaggag catcgtggaa aaagaagacg 240 

gatgtgatgg tccgatgtga gacttttcaa 300 

ttccattgcc cagctatctg tcactttatt 360 

tacaaatgcc atcattgcga taaaggaaag 42 0 

ggtcccaaag atggaccccc acccacgagg 4 80 

acgtcttcaa agcaagtgga ttgatgtgat 540 



<400> 15 

ggccgcgtta actgcaggtc cgatgtgaga 
tcctcggatt ccattgccca gctatctgtc 
gtggctccta caaatgccat cattgcgata 
ccgacagtgg tcccaaagat ggacccccac 
ttccaaccac gtcttcaaag caagtggatt 
caaagggtaa tatccggaaa cctcctcgga 
gtgaagatag tggaaaagga aggtggctcc 
gccatcgttg aagatgcctc tgccgacagt 
agcatcgtgg aaaaagaaga cgttccaacc 



wo 00/26371 



14 



PCTAJS99/26086 



atctccactg acgtaaggga tgacgcacaa tcccactatc cttcgcaaga cccttcctct 600 

atataaggaa gttcatttca tttggagagg acacagaaaa atttgctaca ttgtttcaca 660 

aacttcaaat attattcatt tatttgtcag ctttcaaact ctttgtttct tgtttgttga 720 

ttgagaatac aatggcgcaa gttagcagaa tctgcaatgg tgtgcagaac ccatctctta 780 

tctccaatct ctcgaaatcc agtcaacgca aatctccctt atcggtttct ctgaagacgc 840 

agcagcatcc acgagcttat ccgatttcgt cgtcgtgggg attgaagaag agtgggatga 900 

cgttaattgg ctctgagctt cgtcctctta aggtcatgtc ttctgtttcc acggcgtgca 960 

tgcttgccat ggacaactcc gtcctgaact ctggtcgcac caccatctgc gacgcctaca 1020 

acgtcgcggc gcatgatcca ttcagcttcc agcacaagag cctcgacact gttcagaagg 1080 

agtggacgga gtggaagaag aacaaccaca gcctgtacct ggaccccatc gtcggcacgg 1140 
tggccagctt ccttctcaag aaggtcggct ctctcgtcgg gaagcgcatc ctctcggaac ' 1200 

tccgcaacct gatctttcca tctggctcca ccaacctcat gcaagacatc ctcagggaga 1260 

ccgagaagtt tctcaaccag cgcctcaaca ctgataccct tgctcgcgtc aacgctgagc 1320 

tgacgggtct gcaagcaaac gtggaggagt tcaaccgcca agtggacaac ttcctcaacc 1380 

ccaaccgcaa tgcggtgcct ctgtccatca cttcttccgt gaacaccatg caacaactgt 1440 

tcctcaaccg cttgcctcag ttccagatgc aaggctacca gctgctcctg ctgccactct 1500 

ttgctcaggc tgccaacctg cacctctcct tcattcgtga cgtgatcctc aacgctgacg 1560 

agtggggcat ctctgcagcc acgctgagga cctaccgcga ctacctgaag aactacacca 1620 

gggactactc caactattgc atcaacacct accagtcggc cttcaagggc ctcaatacga 1680 

ggcttcacga catgctggag ttcaggacct acatgttcct gaacgtgttc gagtacgtca 174 0 

gcatctggtc gctcttcaag taccagagcc tgctggtgtc cagcggcgcc aacctctacg 1800 

ccagcggctc tggtccccaa caaactcaga gcttcaccag ccaggactgg ccattcctgt 1860 

attcgttgtt ccaagtcaac tccaactacg tcctcaacgg cttctctggt gctcgcctct 1920 

ccaacacctt ccccaacatt gttggcctcc ccggctccac cacaactcat gctctgcttg 1980 

ctgccagagt gaactactcc ggcggcatct cgagcggcga cattggtgca tcgccgttca 2040 

accagaactt caactgctcc accttcctgc cgccgctgct caccccgttc gtgaggtcct 2100 

ggctcgacag cggctccgac cgcgagggcg tggccaccgt caccaactgg caaaccgagt 2160 

ccttcgagac cacccttggc ctccggagcg gcgccttcac ggcgcgtggg aattctaact 2220 

acttccccga ctacttcatc aggaacatct ctggtgttcc tctcgtcgtc cgcaacgagg 2280 

acctccgccg tccactgcac tacaacgaga tcaggaacat cgcctctccg tccgggacgc 2340 

ccggaggtgc aagggcgtac atggtgagcg tccataacag gaagaacaac atccacgctg 2400 

tgcatgagaa cggctccatg atccacctgg cgcccaatga ttacaccggc ttcaccatct 2460 

ctccaatcca cgccacccaa gtgaacaacc agacacgcac cttcatctcc gagaagttcg 2 520 

gcaaccaggg cgactccctg aggttcgagc agaacaacac caccgccagg tacaccctgc 2580 

gcggcaacgg caacagctac aacctgtacc tgcgcgtcag ctccattggc aactccacca 2640 

tcagggtcac catcaacggg agggtgtaca cagccaccaa tgtgaacacg acgaccaaca 2700 

atgatggcgt caacgacaac ggcgcccgct tcagcgacat caacattggc aacgtggtgg 2760 

ccagcagcaa ctccgacgtc ccgctggaca tcaacgtgac cctgaactct ggcacccagt 2820 

tcgacctcat gaacatcatg ctggtgccaa ctaacatctc gccgctgtac tgataggagc 2 880 

tctgatcccc atgggaattc ccgatcgttc aaacatttgg caataaagtt tcttaagatt 2940 

gaatcctgtt gccggtcttg cgatgattat catataattt ctgttgaatt acgttaagca 3000 

tgtaataatt aacatgtaat gcatgacgtt atttatgaga tgggttttta tgattagagt 3060 

cccgcaatta tacatttaat acgcgataga aaacaaaata tagcgcgcaa actaggataa 3120 

attatcgcgc gcggtgtcat ctatgttact agatcgggga tatccccggg gcggccgctc 3180 

gagtggtggc cgcatcgatc gtgaagtttc tcatctaagc ccccatttgg acgtgaatgt 3240 

agacacgtcg aaataaagat ttccgaatta gaataatttg tttattgctt tcgcctataa 3300 

atacgacgga tcgtaatttg tcgttttatc aaaatgtact ttcattttat aataacgctg 3360 

cggacatcta catttttgaa ttgaaaaaaa attggtaatt actctttctt tttctccata 3420 

ttgaccatca tactcattgc tgatccatgt agatttcccg gacatgaagc catttacaat 3480 

tgaatatatc ctgccgccgc tgccgctttg cacccggtgg agcttgcatg ttggtttcta 3540 

cgcagaactg agccggttag gcagataatt tccattgaga actgagccat gtgcaccttc 3600 

cccccaacac ggtgagcgac ggggcaacgg agtgatccac atgggacttt tcctagcttn 366 0 

nnnimnnnnn nnnnnrmiinn nnnnnnnnnn nnnnnnnnnn nnnnrmnnnn nnnnnnrmnn 3 720 

nnimnnnimn nnnccgggag ggttcgagaa gggggggcac cccccttcgg cgtgcgcggt 3780 

cacgcgccag ggcgcagccc tggttaaaaa caaggtttat aaatattggt ttaaaagcag 3840 

gttaaaagac aggttagcgg tggccgaaaa acgggcggaa acccttgcaa atgctggatt 3900 

ttctgcctgt ggacagcccc tcaaatgtca ataggtgcgc ccctcatctg tcatcactct 3960 

gcccctcaag tgtcaaggat cgcgcccctc atctgtcagt agtcgcgccc ctcaagtgtc 4 020 

aataccgcag ggcacttatc cccaggcttg tccacatcat ctgtgggaaa ctcgcgtaaa 4 080 

atcaggcgtt ttcgccgatt tgcgaggctg gccagctcca cgtcgccggc cgaaatcgag 4140 

cctgcccctc atctgtcaac gccgcgccgg gtgagtcggc ccctcaagtg tcaacgtccg 4200 
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cccctcatct gtcagtgagg gccaagtttt ccgcgtggta tccacaacgc cggcggccgg 4260 

ccgcggtgtc tcgcacacgg cttcgacggc gtttctggcg cgtttgcagg gccatagacg 4320 

gccgccagcc cagcggcgag ggcaaccagc ccggnnnnnn niinnrmnnnn nnnnnnnnnn 4380 

nnnnnnnnim nnnnnnnnnn nnnnnnngtc gatcgaccga tgcccttgag agccttcaac 4440 

ccagtcagct ccttccggtg ggcgcggggc atgactatcg tcgccgcact tatgactgtc 4500 

ttctttatca tgcaactcgt aggacaggtg ccggcagcgc tctgggtcat tttcggcgag 4 560 

gaccgctttc gctggagcgc gacgatgatc ggcctgtcgc ttgcggtatt cggaatcttg 4620 

cacgccctcg ctcaagcctt cgtcactggt cccgccacca aacgtttcgg cgagaagcag 4680 

gccattatcg ccggcatggc ggccgacgcg ctgggctacg tcttgctggc gttcgcgacg 4740 

cgaggctgga tggccttccc cattatgatt cttctcgctt ccggcggcat cgggatgccc 4800 

gcgttgcagg ccatgctgtc caggcaggta gatgacgacc atcagggaca gcttcaagga 4860 

tcgctcgcgg ctcttaccag cctaacttcg atcactggac cgctgatcgt cacggcgatt 4920 

tatgccgcct cggcgagcac atggaacggg ttggcatgga ttgtaggcgc cgccctatac 4 980 

cttgtctgcc tccccgcgtt gcgtcgcggt gcatggagcc gggccacctc gacctgaatg 5040 

gaagccggcg gcacctcgct aacggattca ccactccaag aattggagcc aatcaattct 5100 

tgcggagaac tgtgaatgcg caaaccaacc cttggcagaa catatccatc gcgtccgcca 5160 

tctccagcag ccgcacgcgg cgcatctcgg gcagcgttgg gtcctggcca cgggtgcgca 5220 

tgatcgtgct cctgtcgttg aggacccggc taggctggcg gggttgcctt actggttagc 5280 

agaatgaatc accgatacgc gagcgaacgt gaagcgactg ctgctgcaaa acgtctgcga 5340 

cctgagcaac aacatgaatg gtcttcggtt tccgtgtttc gtaaagtctg gaaacgcgga 54 00 

agtcagcgcc ctgcaccatt atgttccgga tctgcatcgc aggatgctgc tggctaccct 5460 

gtggaacacc tacatctgta ttaacgaagc gctggcattg accctgagtg atttttctct 5520 

ggtcccgccg catccatacc gccagttgtt taccctcaca acgttccagt aaccgggcat 5580 

gttcatcatc agtaacccgt atcgtgagca tcctctctcg tttcatcggt atcattaccc 5640 

ccatgaacag aaattccccc ttacacggag gcatcaagtg accaaacagg aaaaaaccgc 5700 

ccttaacatg gcccgcttta tcagaagcca gacattaacg cttctggaga aactcaacga 5760 

gctggacgcg gatgaacagg cagacatctg tgaatcgctt cacgaccacg ctgatgagct 5820 

ttaccgcagc tgcctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct 5880 

cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 5940 

cgcgtcagcg ggtgttggcg ggtgtcgggg cgcagccatg acccagtcac gtagcgatag 6000 

cggagtgtat actggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 6060 

atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcgctcttcc 6120 

gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 6180 

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 6240 

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 6300 

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 6360 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 6420 

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 6480 

gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 6 540 

ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 6600 

cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 6660 

aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 6720 

tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 6780 

ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 6840 

tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 6900 

ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 6960 

agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 7020 

atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca 7080 

cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag 7140 

ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac 7200 

ccacgctcac cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc 7260 

agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct 7320 

agagtaagta gttcgccagt taatagtttg cgcaacgttg ttgccattgc tgcaggtcgg 7380 

gagcacagga tgacgcctaa caattcattc aagccgacac cgcttcgcgg cgcggcttaa 7440 

ttcaggagtt aaacatcatg agggaagcgg tgatcgccga agtatcgact caactatcag 7500 

aggtagttgg cgtcatcgag cgccatctcg aaccgacgtt gctggccgta catttgtacg 756 0 

gctccgcagt ggatggcggc ctgaagccac acagtgatat tgatttgctg gttacggtga 762 0 

ccgtaaggct tgatgaaaca acgcggcgag ctttgatcaa cgaccttttg gaaacttcgg 7680 

cttcccctgg agagagcgag attctccgcg ctgtagaagt caccattgtt gtgcacgacg 7 740 

acatcattcc gtggcgttat ccagctaagc gcgaactgca atttggagaa tggcagcgca 7800 

atgacattct tgcaggtatc ttcgagccag ccacgatcga cattgatctg gctatcttgc 7860 
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tgacaaaagc aagagaacat agcgttgcct tggtaggtcc agcggcggag gaactctttg 7920 

atccggttcc tgaacaggat ctatttgagg cgctaaatga aaccttaacg ctatggaact 7 980 

cgccgcccga ctgggctggc gatgagcgaa atgtagtgct tacgttgtcc cgcatttggt 8040 

acagcgcagt aaccggcaaa atcgcgccga aggatgtcgc tgaagactgg gcaatggagc 8100 

gcctgccggc ccagtatcag cccgtcatac ttgaagctag gcaggcttat cttggacaag 8160 

aagatcgctt ggcctcgcgc gcagatcagt tggaagaatt tgttcactac gtgaaaggcg 8220 

agatcaccaa ggtagtcggc aaataatgtc taacaattcg ttcaagccga cgccgcttcg 8280 

cggcgcggct taactcaagc gttagatgct gcaggcatcg tggtgtcacg ctcgtcgttt 8340 

ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 8400 

ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg aggatttttc ggcgctgcgc 8460 

tacgtccgck accgcgttga gggatcaagc cacagcagcc cactcgacct ctagccgacc 8520 

cagacgagcc aagggatctt tttggaatgc tgctccgtcg tcaggctttc cgacgtttgg 8580 

gtggttgaac agaagtcatt atcgtacgga atgccaagca ctcccgaggg gaaccctgtg 8640 

gttggcatgc acatacaaat ggacgaacgg ataaaccttt tcacgccctt ttaaatatcc 8700 

gttattctaa taaacgctct tttctcttag gtttacccgc caatatatcc tgtcaaacac 8760 

tgatagttta aactgaaggc gggaaacgac aatctgatcc ccatcaagct tggtcgagtg 8820 

gaagctagct tcccgatcct atctgtcact tcatcaaaag gacagtagaa aaggaaggtg 8880 

gcactacaaa tgccatcatt gcgataaagg aaaggctatc gttcaagatg cctctgccga 8940 

cagtggtccc aaagatggac ccccacccac gaggagcatc gtggaaaaag aagacgttcc 9000 

aaccacgtct tcaaagcaag tggattgatg tgatacttcc actgacgtaa gggatgacgc 9060 

acaatcccac tatccttcgc aagacccttc ctctatataa ggaagttcat ttcatttgga 9120 

gaggacacgc tgaaatcacc agtctctctc tacaagatcg gggatctcta gctagacgat 9180 

cgtttcgcat gattgaacaa gatggattgc acgcaggttc tccggccgct tgggtggaga 9240 

ggctattcgg ctatgactgg gcacaacaga caatcggctg ctctgatgcc gccgtgttcc 9300 

ggctgtcagc gcaggggcgc ccggttcttt ttgtcaagac cgacctgtcc ggtgccctga 9360 

atgaactgca ggacgaggca gcgcggctat cgtggctggc cacgacgggc gttccttgcg 9420 

cagctgtgct cgacgttgtc actgaagcgg gaagggactg gctgctattg ggcgaagtgc 9480 

cggggcagga tctcctgtca tctcaccttg ctcctgccga gaaagtatcc atcatggctg 9540 

atgcaatgcg gcggctgcat acgcttgatc cggctacctg cccattcgac caccaagcga 9600 

aacatcgcat cgagcgagca cgtactcgga tggaagccgg tcttgtcgat caggatgatc 9660 

tggacgaaga gcatcagggg ctcgcgccag ccgaactgtt cgccaggctc aaggcgcgca 9720 

tgcccgacgg cgaggatctc gtcgtgaccc atggcgatgc ctgcttgccg aatatcatgg 9780 

tggaaaatgg ccgcttttct ggattcatcg actgtggccg gctgggtgtg gcggaccgct 984 0 

atcaggacat agcgttggct acccgtgata ttgctgaaga gcttggcggc gaatgggctg 9900 

accgcttcct cgtgctttac ggtatcgccg ctcccgattc gcagcgcatc gccttctatc 9960 

gccttcttga cgagttcttc tgagcgggac tctggggttc gatccccaat tcccgatcgt 10020 

tcaaacattt ggcaataaag tttcttaaga ttgaatcctg ttgccggtct tgcgatgatt 10080 

atcatataat ttctgttgaa ttacgttaag catgtaataa ttaacatgta atgcatgacg 10140 

ttatttatga gatgggtttt tatgattaga gtcccgcaat tatacattta atacgcgata 10200 

gaaaacaaaa tatagcgcgc aaactaggat aaattatcgc gcgcggtgtc atctatgtta 10260 

ctagatcggg gatcgggcca ctcgaccaag cttctgcagg tcctgctcga gc 10312 



<210> 16 
<211> 8349 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> gene 

<222> 3666-5573; 

<223> completely synthesized 



<400> 16 

gcaactgttg ggaagggcga tcggtgcggg 
ggggatgtgc tgcaaggcga ttaagttggg 
gtaaaacgac ggccagtgaa ttgcggccac 
tcacttcatc aaaaggacag tagaaaagga 
taaaggaaag gctatcattc aagatgcctc 



cctcttcgct attacgccag ctggcgaaag 60 

taacgccagg gttttcccag tcacgacgtt 120 

gcgtggtacc aagcttcccg atcctatctg 18 0 

aggtggcacc tacaaatgcc atcattgcga 240 

tgccgacagt ggtcccaaag atggaccccc 300 
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acccacgagg agcatcgtgg aaaaagaaga cgttccaacc acgtcttcaa agcaagtgga 360 

ttgatgtgat acttccactg acgtaaggga atgacgcaca atcccactat ccttcgcaag 420 

acccttcctc tatataagga agttcatttc atttggagag gacacgctga aatcaccagt 480 

ctctctctac aagatcgggg atctctagct agacgatcgt ttcgcatgat tgaacaagat 540 

ggattgcacg caggttctcc ggccgcttgg gtggagaggc tattcggcta tgactgggca 600 

caacagacaa tcggctgctc tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg 660 

gttctttttg tcaagaccga cctgtccggt gccctgaatg aactgcagga cgaggcagcg 720 

cggctatcgt ggctggccac gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact 780 

gaagcgggaa gggactggct gctattgggc gaagtgccgg ggcaggatct cctgtcatct 840 

caccttgctc ctgccgagaa agtatccatc atggctgatg caatgcggcg gctgcatacg 900 

cttgatccgg ctacctgccc attcgaccac caagcgaaac atcgcatcga gcgagcacgt 960 

actcggatgg aagccggtct tgtcgatcag gatgatctgg acgaagagca tcaggggctc 1020 

gcgccagccg aactgttcgc caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc 1080 

gtgacccatg gcgatgcctg cttgccgaat atcatggtgg aaaatggccg cttttctgga 1140 

ttcatcgact gtggccggct gggtgtggcg gaccgctatc aggacatagc gttggctacc 12 00 

cgtgatattg ctgaagagct tggcggcgaa tgggctgacc gcttcctcgt gctttacggt 1260 

atcgccgctc ccgattcgca gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga 1320 

gcgggactct ggggttcgaa atgaccgacc aagcgacgcc caacctgcca tcacgagatt 1380 

tcgattccac cgccgccttc tatgaaaggt tgggcttcgg aatcgttttc cgggacgccg 1440 

gctggatgat cctccagcgc ggggatctca tgctggagtt cttcgcccac ccccggatcc 1500 

ccatgggaat tcccgatcgt tcaaacattt ggcaataaag tttcttaaga ttgaatcctg 1560 

ttgccggtct tgcgatgatt atcatataat ttctgttgaa ttacgttaag catgtaataa 1620 

ttaacatgta atgcatgacg ttatttatga gatgggtttt tatgattaga gtcccgcaat 1680 

tatacattta atacgcgata gaaaacaaaa tatagcgcgc aaactaggat aaattatcgc 1740 

gcgcggtgtc atctatgtta ctagatcggg gatatccccg cggccgcgtt aacaagcttc 1800 

tgcaggtccg atgtgagact tttcaacaaa gggtaatatc cggaaacctc ctcggattcc 1860 

attgcccagc tatctgtcac tttattgtga agatagtgga aaaggaaggt ggctcctaca 1920 

aatgccatca ttgcgataaa ggaaaggcca tcgttgaaga tgcctctgcc gacagtggtc 1980 

ccaaagatgg acccccaccc acgaggagca tcgtggaaaa agaagacgtt ccaaccacgt 2040 

cttcaaagca agtggattga tgtgatggtc cgatgtgaga cttttcaaca aagggtaata 2100 

tccggaaacc tcctcggatt ccattgccca gctatctgtc actttattgt gaagatagtg 2160 

gaaaaggaag gtggctccta caaatgccat cattgcgata aaggaaaggc catcgttgaa 2220 

gatgcctctg ccgacagtgg tcccaaagat ggacccccac ccacgaggag catcgtggaa 2280 

aaagaagacg ttccaaccac gtcttcaaag caagtggatt gatgtgatat ctccactgac 2340 

gtaagggatg acgcacaatc ccactatcct tcgcaagacc cttcctctat ataaggaagt 2400 

tcatttcatt tggagaggac acgctgacaa gctgactcta gcagatctac cgtcttcggt 2460 

acgcgctcac tccgccctct gcctttgtta ctgccacgtt tctctgaatg ctctcttgtg 2520 

tggtgattgc tgagagtggt ttagctggat ctagaattac actctgaaat cgtgttctgc 2580 

ctgtgctgat tacttgccgt cctttgtagc agcaaaatat agggacatgg tagtacgaaa 2640 

cgaagataga acctacacag caatacgaga aatgtgtaat ttggtgctta gcggtattta 2700 

tttaagcaca tgttggtgtt atagggcact tggattcaga agtttgctgt taatttaggc 2760 

acaggcttca tactacatgg gtcaatagta tagggattca tattataggc gatactataa 2820 

taatttgttc gtctgcagag cttattattt gccaaaatta gatattccta ttctgttttt 2880 

gtttgtgtgc tgttaaattg ttaacgcctg aaggaataaa tataaatgac gaaattttga 2940 

tgtttatctc tgctccttta ttgtgaccat aagtcaagat cagatgcact tgttttaaat 3000 

attgttgtct gaagaaataa gtactgacag tattttgatg cattgatctg cttgtttgtt 3060 

gtaacaaaat ttaaaaataa agagtttcct ttttgttgct ctccttacct cctgatggta 3120 

tctagtatct accaactgac actatattgc ttctctttac atacgtatct tgctcgatgc 3180 

cttctcccta gtgttgacca gtgttactca catagtcttt gctcatttca ttgtaatgca 3240 

gataccaagc ggcctctaga ggatcagcat ggcgcccacc gtgatgatgg cctcgtcggc 3300 

caccgccgtc gctccgttcc tggggctcaa gtccaccgcc agcctccccg tcgcccgccg 3360 

ctcctccaga agcctcggca acgtcagcaa cggcggaagg atccggtgca tgcaggtaac 3420 

aaatgcatcc tagctagtag ttctttgcat tgcagcagct gcagctagcg agttagtaat 3480 

aggaagggaa ctgatgatcc atgcatggac tgatgtgtgt tgcccatccc atcccatccc 3540 

atttcccaaa cgaaccgaaa acaccgtact: acgtgcaggt gtggccctac ggcaacaaga 3600 

agttcgagac gctgtcgtac ctgccgccgc tgtcgaccgg cgggcgcatc cgctgcatgc 3660 

aggccatgga caactccgtc ctgaactctg gtcgcaccac catctgcgac gcctacaacg 3720 

tcgcggcgca tgatccattc agcttccagc acaagagcct cgacactgtt cagaaggagt 3780 

ggacggagtg gaagaagaac aaccacagcc tgtacctgga ccccatcgtc ggcacggtgg 384 0 

ccagcttcct tctcaagaag gtcggctctc tcgtcgggaa gcgcatcctc tcggaactcc 3900 

gcaacctgat ctttccatct ggctccacca acctcatgca agacatcctc agggagaccg 3960 
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agaagtttct caaccagcgc ctcaacactg atacccttgc tcgcgtcaac gctgagctga 4020 

cgggtctgca agcaaacgtg gaggagttca accgccaagt ggacaacttc ctcaacccca 4080 

accgcaatgc ggtgcctctg tccatcactt cttccgtgaa caccatgcaa caactgttcc 4140 

tcaaccgctt gcctcagttc cagatgcaag gctaccagct gctcctgctg ccactctttg 4200 

ctcaggctgc caacctgcac ctctccttca ttcgtgacgt gatcctcaac gctgacgagt 4260 

ggggcatctc tgcagccacg ctgaggacct accgcgacta cctgaagaac tacaccaggg 4320 

actactccaa ctattgcatc aacacctacc agtcggcctt caagggcctc aatacgaggc 4380 

ttcacgacat gctggagttc aggacctaca tgttcctgaa cgtgttcgag tacgtcagca 4440 

tctggtcgct cttcaagtac cagagcctgc tggtgtccag cggcgccaac ctctacgcca 4500 

gcggctctgg tccccaacaa actcagagct tcaccagcca ggactggcca ttcctgtatt 4560 

cgttgttcca agtcaactcc aactacgtcc tcaacggctt ctctggtgct cgcctctcca 4620 

acaccttccc caacattgtt ggcctccccg gctccaccac aactcatgct ctgcttgctg 4680 

ccagagtgaa ctactccggc ggcatctcga gcggcgacat tggtgcatcg ccgttcaacc 4740 

agaacttcaa ctgctccacc ttcctgccgc cgctgctcac cccgttcgtg aggtcctggc 4800 

tcgacagcgg ctccgaccgc gagggcgtgg ccaccgtcac caactggcaa accgagtcct 4860 

tcgagaccac ccttggcctc cggagcggcg ccttcacggc gcgtgggaat tctaactact 4920 

tccccgacta cttcatcagg aacatctctg gtgttcctct cgtcgtccgc aacgaggacc 4980 

tccgccgtcc actgcactac aacgagatca ggaacatcgc ctctccgtcc gggacgcccg 504 0 

gaggtgcaag ggcgtacatg gtgagcgtcc ataacaggaa gaacaacatc cacgctgtgc 5100 

atgagaacgg ctccatgatc cacctggcgc ccaatgatta caccggcttc accatctctc 5160 

caatccacgc cacccaagtg aacaaccaga cacgcacctt catctccgag aagttcggca 5220 

accagggcga ctccctgagg ttcgagcaga acaacaccac cgccaggtac accctgcgcg 5280 

gcaacggcaa cagctacaac ctgtacctgc gcgtcagctc cattggcaac tccaccatca 5340 

gggtcaccat caacgggagg gtgtacacag ccaccaatgt gaacacgacg accaacaatg 5400 

atggcgtcaa cgacaacggc gcccgcttca gcgacatcaa cattggcaac gtggtggcca 5460 

gcagcaactc cgacgtcccg ctggacatca acgtgaccct gaactctggc acccagttcg 5520 

acctcatgaa catcatgctg gtgccaacta acatctcgcc gctgtactga taggagctct 5580 

gatccccatg ggaattcccg atcgttcaaa catttggcaa taaagtttct taagattgaa 5640 

tcctgttgcc ggtcttgcga tgattatcat ataatttctg ttgaattacg ttaagcatgt 5700 

aataattaac atgtaatgca tgacgttatt tatgagatgg gtttttatga ttagagtccc 5760 

gcaattatac atttaatacg cgatagaaaa caaaatatag cgcgcaaact aggataaatt 5820 

atcgcgcgcg gtgtcatcta tgttactaga tcggggatat ccccggggcg gccgcgggga 5880 

attcggtacc aagcttacgc gtggccgcag cttggcgtaa tcatggtcat agctgtttcc 5940 

tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 6000 

taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 6060 

cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg 6120 

gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 6180 

ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 6240 

agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 63 00 

ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 6360 

caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 6420 

gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 6480 

cctgtccgcc tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta 6540 

tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 6600 

gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 6660 

cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 672 0 

tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 6780 

tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 6840 

caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 6900 

aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 6960 

cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 7020 

ccttttgggg tgggcgaaga actccagcat gagatccccg cgctggagga tcatccagcc 7080 

ggcgtcccgg aaaacgattc cgaagcccaa cctttcatag aaggcggcgg tggaatcgaa 7140 

atctcgtgat ggcaggttgg gcgtcgcttg gtcggtcatt tcgaacccca gagtcccgct 7200 

cagaagaact cgtcaagaag gcgatagaag gcgatgcgct gcgaatcggg agcggcgata 7260 

ccgtaaagca cgaggaagcg gtcagcccat tcgccgccaa gctcttcagc aatatcacgg 7320 

gtagccaacg ctatgtcctg atagcggtcc gccacaccca gccggccaca gtcgatgaat 7380 

ccagaaaagc ggccattttc caccatgata ttcggcaagc aggcatcgcc atgggtcacg 744 0 

acgagatcct cgccgtcggg catgcgcgcc ttgagcctgg cgaacagttc ggctggcgcg 7500 

agcccctgat gctcttcgtc cagatcatcc tgatcgacaa gaccggcttc catccgagta 7560 

cgtgctcgct cgatgcgatg tttcgcttgg tggtcgaatg ggcaggtagc cggatcaagc 7620 
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gtatgcagcc gccgcattgc atcagccatg atggatactt tctcggcagg agcaaggtga 7680 

gatgacagga gatcctgccc cggcacttcg cccaatagca gccagtccct tcccgcttca 7740 

gtgacaacgt cgagcacagc tgcgcaagga acgcccgtcg tggccagcca cgatagccgc 7800 

gctgcctcgt cctgcagttc attcagggca ccggacaggt cggtcttgac aaaaagaacc 7860 

gggcgcccct gcgctgacag ccggaacacg gcggcatcag agcagccgat tgtctgttgt 7920 

gcccagtcat agccgaatag cctctccacc caagcggccg gagaacctgc gtgcaatcca 7980 

tcttgttcaa tcatgcgaaa cgatcctcat cctgtctctt gatcagatct tgatcccctg 8040 

cgccatcaga tccttggcgg caagaaagcc atccagttta ctttgcaggg cttcccaacc 8100 

ttaccagagg gcgccccagc tggcaattcc ggttcgcttg ctgtccataa aaccgcccag 8160 

tctagctatc gccatgtaag cccactgcaa gctacctgct ttctctttgc gcttgcgttt 8220 

tcccttgtcc agatagccca gtagctgaca ttcatccggg gtcagcaccg tttctgcgga 8280 

ctggctttct acgtgttccg cttcctttag cagcccttgc gccctgagtg cttgcggcag 8340 

cgtgaagct 33^^ 



<210> 17 
<211> 1912 
<212> DNA 

<213> Bacillus thuringiensis 



<400> 17 

atgaatagtg tattgaatag cggaagaact actatttgtg atgcgtataa tgtagcggct 60 

catgatccat ttagttttca acacaaatca ttagataccg tacaaaagga atggacggag 120 

tggaaaaaaa ataatcatag tttataccta gatcctattg ttggaactgt ggctagtttt iso 

ctgttaaaga aagtggggag tcttgttgga aaaaggatac taagtgagtt acggaattta 240 

atatttccta gtggtagtac aaatctaatg caagatattt taagagagac agaaaaattc 300 

ctgaatcaaa gacttaatac agacactctt gcccgtgtaa atgcggaatt gacagggctg 360 

caagcaaatg tagaagagtt taatcgacaa gtagataatt ttttgaaccc taaccgaaac 420 

gctgttcctt tatcaataac ttcttcagtt aatacaatgc aacaattatt tctaaataga 480 

ttaccccagt tccagatgca aggataccaa ctgttattat tacctttatt tgcacaggca 54 0 

gccaatttac atctttcttt tattagagat gttattctaa atgcagatga atggggaatt 6 00 

tcagcagcaa cattacgtac gtatcgagat tacttgaaaa attatacaag agattactct 660 

aactattgta taaatacgta tcaaagtgcg tttaaaggtt taaacactcg tttacacgat 720 

atgttagaat ttagaacata tatgttttta aatgtatttg agtatgtatc tatctggtcg 780 

ttgtttaaat atcaaagtct tctagtatct tccggtgcta atttatatgc aagtggtagt 840 

ggaccacagc agacccaatc atttacttca caagactggc catttttata ttctcttttc 900 

caagttaatt caaattatgt gttaaatgga tttagtggtg ctaggctttc taataccttc 960 

cctaatatag ttggtttacc tggttctact acaactcacg cattgcttgc tgcaagggtt 1020 

aattacagtg gaggaatttc gtctggtgat ataggtgcat ctccgtttaa tcaaaatttt 1080 

aattgtagca catttctccc cccattgtta acgccatttg ttaggagttg gctagattca 1140 

ggttcagatc gggagggcgt tgccaccgtt acaaattggc aaacagaatc ctttgagaca 1200 

actttagggt taaggagtgg tgcttttaca gctcgcggta attcaaacta tttcccagat 1260 

tattttattc gtaatatttc tggagttcct ttagttgtta gaaatgaaga tttaagaaga 1320 

ccgttacact ataatgaaat aagaaatata gcaagtcctt caggaacacc tggtggagca 1380 

cgagcttata tggtatctgt gcataacaga aaaaataata tccatgctgt tcatgaaaat 144 o 

ggttctatga ttcatttagc gccaaatgac tatacaggat ttactatttc gccgatacat 1500 

gcaactcaag tgaataatca aacacgaaca tttatttctg aaaaatttgg aaatcaaggt 1560 

gattctttaa ggtttgaaca aaacaacacg acagctcgtt atacgcttag agggaatgga 1620 

aatagttaca atctttattt aagagtttct tcaataggaa attccactat tcgagttact 1680 

ataaacggta gggtatatac tgctacaaat gttaatacta ctacaaataa cgatggagtt 1740 

aatgataatg gagctcgttt ttcagatatt aatatcggta atgtagtagc aagtagtaat I8OO 

tctgatgtac cattagatat aaatgtaaca ttaaactccg gtactcaatt tgatcttatg i860 

aatattatgc ttgtaccaac taatatttca ccactttatt aaggtttgag ta 1912 

<210> 18 
<211> 633 
<212> PRT 

<213> Bacillus thuringiensis 



<400> 18 
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Met Asn Ser Val Leu Asn Ser Gly Arg Thr Thr He Cys Asp Ala Tyr 

15 10 15 

Asn Val Ala Ala His Asp Pro Phe Ser Phe Gin His Lys Ser Leu Asp 

20 25 30 

Thr Val Gin Lys Glu Trp Thr Glu Trp Lys Lys Asn Asn His Ser Leu 

35 40 45 

Tyr Leu Asp Pro He Val Gly Thr Val Ala Ser Phe Leu Leu Lys Lys 

50 55 60 

Val Gly Ser Leu Val Gly Lys Arg He Leu Ser Glu Leu Arg Asn Leu 
65 70 75 80 

He Phe Pro Ser Gly Ser Thr Asn Leu Met Gin Asp He Leu Arg Glu 

85 90 95 

Thr Glu Lys Phe Leu Asn Gin Arg Leu Asn Thr Asp Thr Leu Ala Arg 

100 105 110 

Val Asn Ala Glu Leu Thr Gly Leu Gin Ala Asn Val Glu Glu Phe Asn 

115 120 125 

Arg Gin Val Asp Asn Phe Leu Asn Pro Asn Arg Asn Ala Val Pro Leu 

130 135 140 

Ser He Thr Ser Ser Val Asn Thr Met Gin Gin Leu Phe Leu Asn Arg 
145 150 155 160 

Leu Pro Gin Phe Gin Met Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 

165 170 175 

Phe Ala Gin Ala Ala Asn Leu His Leu Ser Phe He Arg Asp Val He 

180 185 190 

Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Leu Arg Thr Tyr 

195 200 205 

Arg Asp Tyr Leu Lys Asn Tyr Thr Arg Asp Tyr Ser Asn Tyr Cys He 

210 215 220 

Asn Thr Tyr Gin Ser Ala Phe Lys Gly Leu Asn Thr Arg Leu His Asp 
225 230 235 240 

Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 

245 250 255 

Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser Gly 

260 265 270 

Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser Phe 

275 280 285 

Thr Ser Gin Asp Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser 

290 295 300 

Asn Tyr Val Leu Asn Gly Phe Ser Gly Ala Arg Leu Ser Asn Thr Phe 
305 310 315 320 

Pro Asn He Val Gly Leu Pro Gly Ser Thr Thr Thr His Ala Leu Leu 

325 330 335 

Ala Ala Arg Val Asn Tyr Ser Gly Gly He Ser Ser Gly Asp He Gly 

340 345 350 

Ala Ser Pro Phe Asn Gin Asn Phe Asn Cys Ser Thr Phe Leu Pro Pro 

355 360 365 

Leu Leu Thr Pro Phe Val Arg Ser Trp Leu Asp Ser Gly Ser Asp Arg 

370 375 380 

Glu Gly Val Ala Thr Val Thr Asn Trp Gin Thr Glu Ser Phe Glu Thr 
385 390 395 400 

Thr Leu Gly Leu Arg Ser Gly Ala Phe Thr Ala Arg Gly Asn Ser Asn 

405 410 415 

Tyr Phe Pro Asp Tyr Phe He Arg Asn He Ser Gly Val Pro Leu Val 

420 425 430 

Val Arg Asn Glu Asp Leu Arg T^g Pro Leu His Tyr Asn Glu He Arg 

435 440 445 

Asn He Ala Ser Pro Ser Gly Thr Pro Gly Gly Ala Arg Ala Tyr Met 

450 455 460 

Val Ser Val His Asn Arg Lys Asn Asn He His Ala Val His Glu Asn 
465 470 475 480 

Gly Ser Met He His Leu Ala Pro Asn Asp Tyr Thr Gly Phe Thr He 



wo 00/26371 



21 



PCT/US99/26086 











485 










490 










495 




Ser 


Pro 


He 


His 


Ala 


Thr 


Gin 


Val 


Asn 


Asn 


Gin 


Thr 


Arg 


Thr 


Phe 


He 








500 










505 








510 






Ser 


Glu 


Lys 
515 


Phe 


Gly 


Asn 


Gin 


Gly 
520 


Asp 


Ser 


Leu 


Arg 


Phe 
525 


Glu 


Gin 


Asn 


Asn 


Thr 


Thr 


Ala 


Arg 


Tyr 


Thr 


Leu 


Arg 


Gly 


Asn 


Gly 


Asn 


Ser 


Tyr 


Asn 




530 










535 










540 








Leu 


Tyr 


Leu 


Arg 


Val 


Ser 


Ser 


He 


Gly 


Asn 


Ser 


Thr 


He 


Arg 


Val 


Thr 


545 










550 










555 








560 


He 


Asn 


Gly 


Arg 


Val 
565 


Tyr 


Thr 


Ala 


Thr 


Asn 
570 


Val 


Asn 


Thr 


Thr 


Thr 
575 


Asn 


Asn 


Asp 


Gly 


Val 


Asn 


Asp 


Asn 


Gly 


Ala 


Arg 


Phe 


Ser 


Asp 


He 


Asn 


He 








580 










585 








590 






Gly 


Asn 


Val 


Val 


Ala 


Ser 


Ser 


Asn 


Ser 


Asp 


Val 


Pro 


Leu 


Asp 


He 


Asn 






595 










600 










605 






Val 


Thr 
610 


Leu 


Asn 


Ser 


Gly 


Thr 
615 


Gin 


Phe 


Asp 


Leu 


Met 
620 


Asn 


He 


Met 


Leu 


Val 


Pro 


Thr 


Asn 


He 


Ser 


Pro 


Leu 


Tyr 

















625 630 



INTERNATIONAL SEARCH REPORT 



iNo 

PCT/US 99/26086 



A. OJISSRCATION OP 8UaCCT MATTER 

IPC 7 C12N15/12 C12N15/82 C12N5/10 A01H5/00 



AoooitikHltetnteiwdtenalPBflertaaeaWcalto 



AMnifndoounentadonaeaiched (classHteaflongyatemfoiowftdbyclaagffloallon gymbote) 

IPC 7 AOIH C07K C12N 



Docunenterikmaeeithed ether than mk^^ In the fields searched 



BeotionlcdatalXMeoofmitedduilnglhe Memellonaleeeioh (name of data Imm and. whefepracflcalaearohtefmsuaed) 



a D0CUBENT8 CONSIDERED TO BE RELEVANT 



Categay • Citation oTdocunent, wtth hdcaHon, whem appropriate, of the relevant paasagee 



Relevant to dakn No. 



WONG E Y ET AL: "ARABIDOPSIS THALIANA 
SMALL SUBUNIT LEADER AND TRANSIT PEPTIDE 
ENHANCE THE EXPRESSION OF BACILLUS 
THURINGIENSIS PROTEINS IN TRANSGENIC 
PLANTS" 

PLANT MOLECULAR BIOLOGY, NL,NIJHOFF 
PUBLISHERS, DORDRECHT, 
vol. 20, no. 1, 

1 October 1992 (1992-10-01), pages 81-93, 
XP000570449 

ISSN: 0167-4412 
the whole document 



2,4,6,9, 

11.13. 

15-26, 

28,30, 

32,34, 

36,38, 

40-56 



p(] Further doctinento are Istedfri the ocntinuallcn of box a 


|X \ Patent family members aielstedhamex. 


* Special categories ol dted docunente : 

"A" docimentdeAnk>g1he general state oflhe art which teno« 
considered to be of particular relevance 


T" later document ptfcllstied after the htematlonal flilig date 
or priority date and not hoonflktvrith the applcaHon but 
dted to itiderstand the principle or theory inderfylng the 
hnvenllon 

'KT docunent of periJciiarielevance; the claimed hiventlon 
camot be considered novel or cannot be coneklered to 
Involve an Inventive step when the document Is taken alone 

"Y" docunent of paiUctJarralevdnce; the claimed Invention 
oamot be considered to Involve an lnver\ltve stepwhenthe 
docunent Is combined with one or more other such docu- 
ments, such oombkwiton being obvious to a pefsonskUed 
IntheaiL 

*&" docunent member of the same paterttfamly 


"E" eeiller docunent but pubiahed on or after ttie htemallonat 
fling date 

V docunent wtilchrnay throw doitte on priority clakn(s)or 
\Aihlch Is cHed to eetabilah the pUillcaion date of another 
dtattcn or ctfier special reason (as specMed) 

"0" dooumentrefenfngtoanoraldsdoeure.uae, exHbMonor 
cttMfineans 

"P" docunent piMahed prior to the bTtemadonal flft^gdatebut 
later than the priority date dabned 


Date of the actual complelkn of the IntetnatkNTOl search 


Date of maling of the International search report 


7 Harch 2000 


24/03/2000 


Name and maflkig address of the ISA 


Authorized offloer 


European Patent Office* RB. 5818 Potenltaan 2 
NL~2280HVRIj8%«4|c 
Tel. (431.70) a40-204a Tx. 31 661 epo r^. 
Fax: (431-70) 340-3016 


Burkhardt, P 



Pom PCT/t8Afi10 (aceand chMt) (Aty 1M2) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



tnt Sonal Appfiootion No 

PCT/US 99/26086 



a<Oontiitiation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Cateooiy* Cttadon of docunent, with h<icailon.wher» appropriate, of the releveyit pasaagea 



Relevant to dakn No. 



MIDNER U R ET AL: "TWO HIGHLY RELATED 
INSECTICIDAL CRYSTAL PROTEINS OF BACILLUS 
THURINGIENSIS SUBSP. KURSTAKI POSSESS 
DIFFERENT HOST RANGE SPECIFICITIES" 
JOURNAL OF BACTERIOLOGY, US, WASHINGTON, DC, 
vol. 171, no. 2, 

1 February 1989 (1989-02-01), pages 
965-974, XP002071367 

ISSN: 0021-9193 

figure 2 

UO 95 24492 A (CALGENE INC ; STATE 
UNIVERSITY OF NEW JERSEY (US)) 
14 September 1995 (1995-09-14) 



page 13, lines 30-32 

page 15, line 7 -page 16, line 9; examples 
4.5 

US 5 689 052 A (SANDERS PATRICIA RIGDEN 
ET AL) 18 November 1997 (1997-11-18) 
SEQ ID NO: 2 

EP 0 385 962 A (MONSANTO CO) 
5 September 1990 (1990-09-05) 
claim 2; figure 13 

CRICKMORE N ET AL: "Revision of the 
nomenclature for the Bacillus 
thurlnglensis pestlcldal crystal 
proteins." 

MICROBIOLOGY AND HOLECULAR BIOLOGY 
REVIEWS, (1998 SEP) 62 (3) 807-13. REF: 
126 , XP002132141 
page 809 -page 810 

KOTA M ET AL: "Overexpresslon of the 
Bacillus thurlnglensis (Bt) Cry2Aa2 
protein in chloroplasts confers resistance 
to plants against susceptible and 
Bt-reslstant insects." 
PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES OF THE UNITED STATES OF AMERICA, 
(1999 MAR 2) 96 (5) 1840-5. , XP002132142 
the whole document 



1-56 



1,3,7,8, 

10,12, 

14,27, 

29,31, 

33,35, 

37,39 



1-56 



1-56 



1-56 



1,3.7,8, 

10,12, 

14,27, 

29,31, 

33,35, 

37,39 



Rrni PCr/nAA10(eentaii«lcne«Meend AM) (Jdy IMS) 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 

infoniMtion on |Krt«m tam^ tiMfiibcrt 



|nta ^ OMI AppliMiion M> 

PCT/US 99/26086 



Patent document 
died bieeaid) report 



Pit)Ocatfc>n 
dat 



Patent famRy 
member(8) 



Pil^catlon 
date 



WO 9524492 


A 




lie 


CCilCOlO A 

b54bol8 A 


13-08-1996 








TA 
tA 


OlOCOilA A 

Z1od340 a 


14-09-1995 








CO 

tr 


0749490 A 


27-12-1996 








ID 

vr 


IOdIIZoI t 


04-11-1998 


US 5689052 


A 




UAur 
NONc 






EP 0385962 


A 


OR— no— 1 Qon 


All 

AU 


638438 B 


01-07-1993 








All 

AU 


5163090 A 


26-09-1990 








CA 


2024811 A 


25-08-1990 








EP 


0413019 A 


20-02-1991 








IL 


93513 A 


14-11-1995 








JP 


3504333 T 


26-09-1991 








NO 


303546 B 


27-07-1998 








NZ 


232654 A 


23-12-1991 








UO 


9010076 A 


07-09-1990 








US 


5880275 A 


0^3-1999 








US 


5500365 A 


19-03-1996 








ZA 


9001417 A 


30-01-1991 








TR 


24354 A 


01-09-1991 



Form PCT/l8Aftio {ptmnfmiftnm^ (JUy logo) 



